Skip to main content

Multipath ...

Issue: Removing a device, or replacing a device with a different SAN LUN can be a bit challenging.

HISTORY: Take this completely hypothetical situation, which may or may not happen to me quite frequently...
your customer asked for 3 x 64 and 1 x 16 Gig LUNs... and for some reason the SAN admin assigns 4 x 64 Gig LUNs. You don't know this until you scan the bus and see them from the OS. At which point you have to tell the SAN admin that you need to replace 1 of the 64 Gig LUNs with a 16 Gig. And you also ask the admin to let you know when he/she removes the incorrect LUN so you can do your procedure to then remove the device from the OS view, and then he/she can proceed with adding the 16 Gig. Well, inevitably you will end up with syslog complaining about a SAN device no longer being available and the fun begins (this is because the admin removed the 64 and added the 16 probably in the same keystroke, or click of a button and this will not give you an opportuntiy to straighten things out...) Well, the only thing that makes this worse is when it happens to a 3-node RAC cluster and have to clean things up on 3 boxes ;-)

# tail -f /var/log/messages
# multipath -ll > /tmp/mpath.out
-- check that output for 'fail' and you should still see the "sd" aliases.
-- confirm the failed device has the same WWN as you were expecting

# echo offline > /sys/class/scsi_disk/2:0:0:44/device/state
# echo offline > /sys/class/scsi_disk/3:0:0:44/device/state
# echo 1 > /sys/class/scsi_disk/2:0:0:44/device/delete
# echo 1 > /sys/class/scsi_disk/3:0:0:44/device/delete
# echo "- - -" > /sys/class/scsi_host/host#/scan
# rescan-scsi-bus.sh
-- the rescan shell script is part of the sg3_utils package

I believe a point of contention is that the LUN being replaced occupies the same device path, although I have not devoted much time to proving this, yet...

#! /bin/sh
# return all offline scsi devices to the running state
for d in /sys/block/sd*/device/state; do if [ `cat $d` = "offline" ]; then echo running > $d; fi; done

Enable additional logging in the lpfc driver:
# echo 0x1f > /sys/module/lpfc/lpfc_log_verbose

Enable extra logging for the scsi-subsystem:
# echo 7 > /sys/module/scsi_mod/scsi_logging_level
~

Comments

Popular posts from this blog

PXE boot a LiveCD image

Summary: I have wanted to build a kickstart environment which hosted a "rescue CD" or LiveCD to allow you to boot over the network after you blew your stuff up and needed to repair a few things.  Today I have worked through a method of doing so, with the help of the people who published a succinct script with the Red Hat Enterprise Virtualization Hypervisor.  (the script will be at the bottom of this post - if I have somehow not followed the GPL, please let me know and I will correct whatever is necessary) NOTE/Warning: The boot will fail due the initrd being too large (645mb).  I'm not sure how to proceed.  This procedure worked for RHEVh, because it is quite a bit smaller.  Hopefully I can report back with progress on this? :-$ Procedure: download your LiveCD image to /export/isos/RESCUE/Fedora-16-i686-Live-Desktop.iso # cd /var/tmp # vi livecd-iso-to-pxeboot (populate the file with the script shown below) # chmod 754 ./livecd-iso-to-pxeb...

"Error getting authority: Error initializing authority: Could not connect: No such file or directory (g-io-error-quark, 1)"

"Error getting authority: Error initializing authority: Could not connect: No such file or directory (g-io-error-quark, 1)" One issue that may cause this to arise is if you managed to break your /etc/fstab We had an engineer add a line with the intended options of "nfsvers=3" but instead added "-onfsvers=3" and it broke the system fairly catastrophically.

P2V using dd for KVM-QEMU guest

Preface: I have certainly not exhaustively tested this process.  I had a specific need and found a specific solution that worked. Situation:  I was issued a shiny new laptop running Red Hat Enterprise Linux 7 (with Corp VPN, certs, Authentication configuration, etc...)  The image was great, but I needed more flexibility on my bare metal.  So, my goal was to P2V the corporate image so I could just run it as a VM. * Remove corporate drive and install new SSD * install corp drive in external USB-3 case * Install RHEL 7 on new SSD * dd old drive to a disk-image file in a temp location which will be an image which is the same size as your actual drive (unless you have enough space in your destination to contain a temp and converted image) * convert the raw disk-image to a qcow file while pushing it to the final location - this step should reduce the disk size - however, I believe it will only reduce/collapse zero-byte blocks (not just free space - i.e. if you de...