Tag: Exadata infiniband switch

  • Change root User Password on Exadata Infiniband Switch

    Change root User Password on Exadata Infiniband Switch

    I was working on changing password for the administrative user accounts on all Exadata Components. I encountered a strange issue while changing the root password on Infiniband Switch. We were unable to change the root password on IB Siwtch using command line method. We used couple different command line methods to change the root password on IB switches but all of them failed. This could be a BUG, firmware issue or something else.

    In this article we demonstrate how to change the root password on an Exadata infiniband switch using Browser User Interface.

    Issue 1: Using passwd command

    Tried to change the root user password using passwd command using dcli. This method assumes you are have ssh equivalence setup from compute node 1. As you can see the command failed saying to use the ILOM shell. In the past I have used the same command successfully to change the root password on IB Switches.

    [root@dm01db01 ~]#  dcli -g ibswitch_group -l root “echo welcome1 | passwd –stdin root”
    dm01sw-ibb01: This command should not be used for ILOM users.
    dm01sw-ibb01: Please use ILOM shell to handle password for this user.
    dm01sw-ibb01: Example:
    dm01sw-ibb01: -> set /SP/users/root password
    dm01sw-ibb01:
    dm01sw-iba01: This command should not be used for ILOM users.
    dm01sw-iba01: Please use ILOM shell to handle password for this user.
    dm01sw-iba01: Example:
    dm01sw-iba01: -> set /SP/users/root password
    dm01sw-iba01:

    So I decided to login to the IB switch directly and use the passwd command instead of running from dcli. The passwd command fail again with the same error.

    [root@dm01sw-iba01 ~]# ssh dm01sw-ibb01
    You are now logged in to the root shell.
    It is recommended to use ILOM shell instead of root shell.
    All usage should be restricted to documented commands and documented
    config files.
    To view the list of documented commands, use “help” at linux prompt.

    [root@dm01sw-ibb01 ~]# hostname
    dm01sw-ibb01

    [root@dm01sw-iba01 ~]# passwd root
    This command should not be used for ILOM users.
    Please use ILOM shell to handle password for this user.
    Example:
       -> set /SP/users/root password

    eBook - Oracle Exadata X8M Patching Recipes | Netsoftmate

    Issue 2: Using ILOM Shell

    As the passwd command failed asking to use the ILOM shell, I login to the IB switch as ilom-admin and executed the change password command. What I see is, the password change command failed at ILOM prompt as well.

    [root@dm01sw-iba01 ~]# su – ilom-admin

    Oracle(R) Integrated Lights Out Manager
    Version 2.2.7-1 ILOM 3.2.6 r118629
    Copyright (c) 2017, Oracle and/or its affiliates. All rights reserved.
    Warning: HTTPS certificate is set to factory default.
     
    Hostname: dm01sw-iba01

    -> set /SP/users/root welcome1
    set: Invalid command syntax
    Usage: set [-script] [target] <property>=<value> [<property>=<value>…]


     

    Solution: Using Browser User Interface

    I have decided to use the BUI to change the password.

    Steps:

    • Open a Browser and enter the IB Switch hostname or IP address

    https://dm01sw-ibb01.netsoftmate.com

    • Accept the security warning and proceed to connect to the IB Switch
    • Enter the username and password to connect to the IB Switch

    • This show the summary page


    • On the left Pan, expand ILOM administration and select User Management


    • Click on  User Accounts, Select root user and click on edit button


    • Enter the new password and confirm and Finally click on the Save button to change the password.


    • To Verify the new password, open a Putty session and ssh to IB Switch using new password.

    [root@dm01db01 ~]# ssh dm01sw-ibb01
    Password:
    You are now logged in to the root shell.
    It is recommended to use ILOM shell instead of root shell.
    All usage should be restricted to documented commands and documented
    config files.
    To view the list of documented commands, use “help” at linux prompt.

    [root@dm01sw-ibb01 ~]# hostname
    dm01sw-ibb01

    Conclusion

    In this article we have learned how to change the root password on Infiniband Switch using Browser User Interface when the command line option doesn’t work.

  • How To Clear Hardware Fault on Exadata Infiniband Switch Manually

    Introduction

    We had a FAN failure on Exadata Infiniband Switch (FAN2). Scheduled the faulty hardware replacement with Oracle. The Oracle Feild Engineer came to the Customer Data Center and replaced the faulty FAN on Infiniband Switch. The FAN replacement was successful however the fault was not cleared automatically. We can still see the FAN was marked faulted from Infiniband BUI and CLI.

    From Infiniband Browser User Interface


    In this article we will demonstrate how to clear the fault on Infiniband Switch after hardware replacement.


    • Login to the Infiniband switch using Putty as root user and check the Infiniband health. From the output below we can see the FANs are all good.

    [root@dm01sw-iba01 ~]# env_test
    Environment test started:
    Starting Environment Daemon test:
    Environment daemon running
    Environment Daemon test returned OK
    Starting Voltage test:
    Voltage ECB OK
    Measured 3.3V Main = 3.28 V
    Measured 3.3V Standby = 3.39 V
    Measured 12V = 11.97 V
    Measured 5V = 5.02 V
    Measured VBAT = 3.14 V
    Measured 2.5V = 2.49 V
    Measured 1.8V = 1.79 V
    Measured I4 1.2V = 1.22 V
    Voltage test returned OK
    Starting PSU test:
    PSU 0 present OK
    PSU 1 present OK
    PSU test returned OK
    Starting Temperature test:
    Back temperature 40
    Front temperature 41
    SP temperature 57
    Switch temperature 55, maxtemperature 59
    Temperature test returned OK
    Starting FAN test:
    Fan 0 not present
    Fan 1 running at rpm 17004
    Fan 2 running at rpm 15696
    Fan 3 running at rpm 17004
    Fan 4 not present
    FAN test returned OK
    Starting Connector test:
    Connector test returned OK
    Starting Onboard ibdevice test:
    Switch OK
    All Internal ibdevices OK
    Onboard ibdevice test returned OK
    Starting SSD test:
    SSD test returned OK
    Starting Auto-link-disable test:
    Auto-link-disable test returned OK
    Environment test PASSED

    • Check the FAN Speed. FAN looks good.

    [root@dm01sw-iba01 ~]# getfanspeed
    Fan 0 not present
    Fan 1 running at rpm 17004
    Fan 2 running at rpm 15478
    Fan 3 running at rpm 17004
    Fan 4 not present

    • Switch to the ilom-admin user

    [root@dm01sw-iba01 ~]# su – ilom-admin

    Oracle(R) Integrated Lights Out Manager

    Version 2.2.9-3 ILOM 3.2.11 r124039

    Copyright (c) 2018, Oracle and/or its affiliates. All rights reserved.

    Warning: HTTPS certificate is set to factory default.

    Hostname: dm01sw-iba01.netsoftmate.com

    ->

    • Now check the fault table for any faulty components. Now we can see the FAN2 is Faulted though the FAN was replaced with a new FAN.

    -> show / -a -l 4 -o table fault_state
    Target                                  | Property                                     | Value
    —————————————-+———————————————-+——————————————————————–
    /SYS                                    | fault_state                                  | OK
    /SYS/MB                                 | fault_state                                  | OK
    /SYS/PSU0                               | fault_state                                  | OK
    /SYS/PSU1                               | fault_state                                  | OK
    /SYS/FAN1                               | fault_state                                  | OK
    /SYS/FAN2                               | fault_state                                  | Faulted /SYS/FAN3                               | fault_state                                  | OK

    ->

    • You can also execute the below command to identify the fault

    -> show -d targets /SP/faultmgmt

     /SP/faultmgmt
        Targets:
            shell
            0 (/SYS/FAN2)

    • Clear the Fault as show below

    -> set /SYS/FAN2 clear_fault_action=true
    Are you sure you want to clear /SYS/FAN2 (y/n)? y
    Set ‘clear_fault_action’ to ‘true’

    • Verify the fault is cleared

    -> show / -a -l 4 -o table fault_state
    Target                                  | Property                                     | Value
    —————————————-+———————————————-+——————————————————————–
    /SYS                                    | fault_state                                  | OK
    /SYS/MB                                 | fault_state                                  | OK
    /SYS/PSU0                               | fault_state                                  | OK
    /SYS/PSU1                               | fault_state                                  | OK
    /SYS/FAN1                               | fault_state                                  | OK
    /SYS/FAN2                               | fault_state                               
       | OK
    /SYS/FAN3                               | fault_state                                  | OK

    -> show -d targets /SP/faultmgmt

     /SP/faultmgmt
        Targets:
            shell

    • Verify from the Infiniband Band BUI

    Conclusion

    In this article we have learned how to identify the fault and clear it manually on an Exadata Infiniband Switch. The ILOM commands comes handy for clearing the fault. You can also clear the fault using the Browser User Interface (BUI).