Solaris 10 Guest LDOM – vdisk@1 is offline

I was having a weird issue with one of my Solaris 10 guest domains. Every once in a while, the server will hang & all the processes provided by this guest domain will be inaccessible. There was no resource constraint at all in the server.

Output from /var/adm/messages
# cat messages.0
Apr 23 03:12:49 sshd[492]: [ID 800047 auth.error] error: accept: Software caused connection abort
Apr 23 03:12:49 inetd[341]: [ID 702911 daemon.error] accept: Software caused connection abort
Apr 23 03:12:51 last message repeated 2 times
Apr 24 15:45:09 vdc: [ID 990228 kern.info] vdisk@1 is offline
Apr 24 15:48:31 pseudo: [ID 129642 kern.info] pseudo-device: devinfo0
Apr 24 15:48:31 genunix: [ID 936769 kern.info] devinfo0 is /pseudo/devinfo@0
Apr 25 00:19:15 pseudo: [ID 129642 kern.info] pseudo-device: devinfo0
Apr 25 00:19:15 genunix: [ID 936769 kern.info] devinfo0 is /pseudo/devinfo@0
Apr 25 02:25:41 pseudo: [ID 129642 kern.info] pseudo-device: mdesc0
Apr 25 02:25:41 genunix: [ID 936769 kern.info] mdesc0 is /pseudo/mdesc@0
Apr 25 02:47:56 vdc: [ID 625787 kern.info] vdisk@1 is online using ldc@6,0
Apr 25 02:47:56 cnex: [ID 799930 kern.info] channel-device: vdc1
Apr 25 02:47:56 genunix: [ID 936769 kern.info] vdc1 is /virtual-devices@100/channel-devices@200/disk@0
Apr 26 03:04:17 sshd[492]: [ID 800047 auth.error] error: accept: Software caused connection abort
# cat messages
Apr 29 00:11:31 vdc: [ID 990228 kern.info] vdisk@1 is offline
Apr 29 00:14:04 pseudo: [ID 129642 kern.info] pseudo-device: devinfo0
Apr 29 00:14:04 genunix: [ID 936769 kern.info] devinfo0 is /pseudo/devinfo@0
Apr 29 02:25:19 pseudo: [ID 129642 kern.info] pseudo-device: mdesc0
Apr 29 02:25:19 [ID 936769 kern.info] mdesc0 is /pseudo/mdesc@0
Apr 29 02:47:33 vdc: [ID 625787 kern.info] vdisk@1 is online using ldc@6,0
Apr 29 02:47:33 cnex: [ID 799930 kern.info] channel-device: vdc1
Apr 29 02:47:33 genunix: [ID 936769 kern.info] vdc1 is /virtual-devices@100/channel-devices@200/disk@0
Apr 29 03:27:37 sshd[492]: [ID 800047 auth.error] error: accept: Software caused connection abort
Apr 29 03:27:38 inetd[341]: [ID 702911 daemon.error] accept: Software caused connection abort
Apr 29 03:27:39 last message repeated 1 time
Apr 30 00:10:53 vdc: [ID 990228 kern.info] vdisk@1 is offline

The solution from Oracle was as below. Apparently, there was some process in the guest domain that keeps unloading the vdc module. This will force Solaris kernel to load the module every time it boots.

  1. vi /etc/filesystem
    • forceload: drv/vdc
  2. reboot the guest domain

Hope you guys had a great Labour Day!

Advertisements
  1. Leave a comment

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

%d bloggers like this: