VMware ESXi HA agent failed

If you force problems with reconfigure HA on a ESXi server in a HA Cluster. A possible solution to resolve this is to uninstall the vpxa and the ha-agent. This can be done using the unsupported mode.

Steps:

* set ESXi in Maintenance Mode
* disconnnect ESXi from Virtual Center
* remove ESXi from Virtual Center
* connect with SSH to ESXi or direct using something like HP iLo
* execute the following commands

cd /opt/vmware/uninstallers/
./VMware-aam-ha-uninstall.sh
./VMware-vpxa-uninstall.sh

* add ESXi to Virtual Center
* exit ESXi from Maintenance Mode

* repeat this with all Nodes in the cluster…

OpenSource RedHat Satellite Server – Spacewalk

I found this software during my search for a management solution for a virtual appliance farm running hundreds of CentOS based virtual machines. As CentOS is binary compatible to RedHat it makes sense to use a RedHat satellite server equivalent for software management.

Spacewalk is an open source (GPLv2) Linux systems management solution. It is the upstream community project from which the Red Hat Network Satellite product is derived.

What Can Spacewalk Do?

Spacewalk is an open source (GPLv2) Linux systems management solution that allows you to:

* Inventory your systems (hardware and software information)
* Install and update software on your systems
* Collect and distribute your custom software packages into manageable groups
* Provision (kickstart) your systems
* Manage and deploy configuration files to your systems
* Monitor your systems
* Provision and start/stop/configure virtual guests
* Distribute content across multiple geographical sites in an efficient manner.

Link

Proxmox VE 1.6 released

A new version of the open source virtualization software Proxmox VE was released!

Change log:

* New 2.6.32 kernel including OpenVZ support, based on latest Debian Squeeze Kernel
* Removed KSM (OpenVZ does not support KSM)
* DRBD tools: update to drbd 8.3.7
* New KVM user-space tools (0.12.5)
* New OpenVZ user-space tools (vzctl 3.0.24)
* vzdump: use ionice to set io priority. By default, we now set “ionice -c2 -n7” to limit io (instead of using a default bwlimit)
* vzrestore: new –force option to overwrite existing data
* dab bug fix: only print a warning for packages installed directly with ‘dpkg’ (when there is no information in ‘available’ list)
* dab: only mount /proc/ if not already mounted (avoid problems with vzctl 3.0.24)
* Bug fixes

Download

long time no post …

I’m still busy @work. We are working on 3 new virtual environments. Therefore I have no time for my side but I will start to post new interessting things asap šŸ™‚ New Powershell code snippets, ideas about a control instance using Powershell, Apache and PHP for VMware and Bugs in VMware vSphere ……

Count VMs ToolsOK | ToolsOld | ToolsNotInstalled

Connect-VIServer -server VCSERVER -User USER -Password PASSWORD

$vms = Get-VM | Sort Name
$on = 0
$toolsOk = 0
$toolsOld = 0
$toolsNotInstalled = 0
foreach ($vm in $vms){
	if ($vm.PowerState -like "PoweredOn"){
		$vmview = $vm | Get-View
		$toolsstatus = $vmview.Guest.ToolsStatus
		$toolsrunningstatus = $vmview.Guest.ToolsRunningStatus
		$toolsversionstatus = $vmview.Guest.ToolsVersionStatus
		$on = $on+1
		if ($toolsstatus -like "toolsOk"){
			$toolsOk = $toolsOk + 1
		}
		if ($toolsstatus -like "toolsOld"){
			$toolsOld = $toolsOld + 1
		}
		if ($toolsstatus -like "toolsNotInstalled"){
			$toolsNotInstalled = $toolsNotInstalled + 1
		}
	}
}
$count = $vms.Count
$off = $count - $on
Write-Host "Total VMs: $count" 
Write-Host "Total powered on: $on"
Write-Host "Total powered off: $off"
Write-Host "ToolsOK: $toolsOk"
Write-Host "ToolsOld: $toolsOld"
Write-Host "ToolsNotInstalled: $toolsNotInstalled"

# DisConnect to Virtual Center
Write-Host "DisConnect from Virtual Center"
Disconnect-VIServer -Server VCSERVER -Confirm:$False

Find VMs with Video Card Memory higher than 30MB

Connect-VIServer -server VCENTER -User USER -Password PASSWORD

$vms = Get-VM
foreach ($vm in $vms) {
	$vmview = $vm | Get-View 
	foreach ($device in $vmview.config.hardware.device)
	{
		if ($device.Key -eq 500){
			$test = $device.DeviceInfo.Summary
			$result = $device.videoRamSizeInKB
			if ($result -cgt 30000){
				$mb = $result/1024
				Write-Host $vm $result $mb
			}
		}
	}
}
Write-Host "Done"

VMware ESXi 3.5 238493 PSOD on AMD Opteron 8356

We have a major incident in our vSphere Environment on 2 ESXi server running ESXi 3.5 Patch 238493. These servers are HP Dl585 G5 with 4x Quad Core AMD Opteron 8356 and 64 GB Memory.

Everytime VM’s getting migrated to this server using VMotion and DRS the server crashed. I have opened a Support Request on VMware site.

Steps to reproduce:

1. add server (HP Dl585 G5 AMD 8356) to cluster
2. exit maintenance mode
3. set another server in this cluster in maintenance mode
4. VM’s get live migrated to new server
5. after 3 – 14 VM’s the new server crash’s with PSOD
6. cancel maintenance mode on other server to save the environment

I checked our Hardware if there is a HW issue but on/in our Boxes everything is working perfectly.

Currently I test some workarounds to get our environment back stable.

1. Install server with ESXi …