VMWare's VMmark

Before we take a look at our own virtualization benchmarking, let us look at the currently (end of August 2010) available VMmark scores.

VMWare VMmark

According to VMmark, the quad Xeon 7560 is about 25% faster than the quad Opteron 6174. VMmark gives a rough idea, but only a rough one. We already wrote down our doubts about VMmark, but here comes another one. The score of 75.34 is achieved with 300 VMs (50 tiles) and 512GB of RAM. That means that each physical Xeon 7500 core is shared by 9.4 VMs ! Now to be honest, this is not the real problem as quite a few servers out there have lots of virtual CPUs mapped onto one CPU. The problem is that VMmark scores use throughput, so the OEM benchmarking experts are completely focusing on throughput and not response times. The result of this is that you get very low (slow), possibly even unacceptable, performance per VM.

 

Let us make this clearer. If you look at the first pages of the VMmark result disclosure of the Dell R815 or Dell R910, you’ll see that the geometric mean score of one tile is around 1.5 (look at the number at the far right). To refresh your memory, a tile consist of 5 active and one idle workload:

  • MS Exchange (2 CPUs)
  • SpecJBB (Java Server, 2 CPUs)
  • Apache web server VM (2 CPUs)
  • MySQL database VM (2 CPUs)
  • SAMBA file server VM (1 CPU)
  • Idle VM

If one tile gets a score of 1.5, it means that it is 50% faster than the reference system which ran only one tile. However, the reference system was an old HP Proliant DL580 G2. This system contained two 2.2GHz single-core Intel Xeon CPUs with Hyper-Threading support, and had 16GB of memory. That is a 130nm Xeon “Galatin”, a CPU very similar to the Pentium 4 “Northwood” Desktop CPU. This is a pretty old Xeon: it was introduced in March 2004. Galatin had a 512KB L2 cache like “Northwood”, but a 2MB L3 cache was added to improve scalability, as this was a Xeon MP processor made for quad socket configurations.

Now Galatin was a pretty decent CPU when it came out, but this CPU was not made nor suitable for a virtualizated consolidation scenario. It had no hardware virtualization whatsoever, and the VM Exit and Entry overhead was no less than six times (and more!) worse than on the Xeon “Nehalem”. You can imagine that running five applications on two of those single core CPUs is not exactly a speedy experience. The file server achieved a "blazing 10MB/s" and the website (the e-commerce website of SPECweb2005) could keep up with about 17 hits per second. Now achieving 50% more than that with an ultra modern system will not please many users. Imagine the surprise of tens of users that have to share a 15MB/s stream while they connect via their 1 gigabit Ethernet ports to the spanking new “state-of-the-art” server that has 10 Gbit Ethernet available.

So the trouble with VMmark is that the highest scores are only a measure of the total throughput; the throughput of the individual applications however is pretty miserable. It is not just a server that is running at 100%, it is a server that is completely overutilized. So the benchmark favors throughput to the extreme, which may well exaggerate differences between the competing systems.

ERP: SAP S&D vApus Mark II
Comments Locked

51 Comments

View All Comments

  • jdavenport608 - Thursday, September 9, 2010 - link

    Appears that the pros and cons on the last page are not correct for the SGI server.
  • Photubias - Thursday, September 9, 2010 - link

    If you view the article in 'Print Format' than it shows correctly.
    Seems to be an Anandtech issue ... :p
  • Ryan Smith - Thursday, September 9, 2010 - link

    Fixed. Thanks for the notice.
  • yyrkoon - Friday, September 10, 2010 - link

    Hey guys, you've got to do better than this. The only thing that drew me to this article was the Name "SGI" and your explanation of their system is nothing.

    Why not just come out and say . . " Hey, look what I've got pictures of". Thats about all the use I have for the "article". Sorry if you do not like that Johan, but the truth hurts.
  • JohanAnandtech - Friday, September 10, 2010 - link

    It is clear that we do not focus on the typical SGI market. But you have noticed that from the other competitors and you know that HPC is not our main expertise, virtualization is. It is not really clear what your complaint is, so I assume that it is the lack of HPC benchmarks. Care to make your complaint a little more constructive?
  • davegraham - Monday, September 13, 2010 - link

    i'll defend Johan here...SGI has basically cornered themselves into the cloud scale market place where their BTO-style of engagement has really allowed them to prosper. If you wanted a competitive story there, the Dell DCS series of servers (C6100, for example) would be a better comparison.

    cheers,

    Dave
  • tech6 - Thursday, September 9, 2010 - link

    While the 815 is great value where the host is CPU bound, most VM workloads seem to be memory limited rather than processing power. Another consideration is server (in particularly memory) longevity which is something where the 810 inherits the 910s RAS features while the 815 misses out.

    I am not disagreeing with your conclusion that the 815 is great value but only if your workload is CPU bound and if you are willing to take the risk of not having RAS features in a data center application.
  • JFAMD - Thursday, September 9, 2010 - link

    True that there is a RAS difference, but you do have to weigh the budget differences and power differences to determine whether the RAS levels of either the R815 (or even a xeon 5600 system) are not sufficient for your application. Keep in mind that the xeon 7400 series did not have these RAS features, so if you were comfortable with the RAS levels of the 7400 series for these apps, then you have to question whether the new RAS features are a "must have". I am not saying that people shouldn't want more RAS (everyone should), but it is more a question of whether it is worth paying the extra price up front and the extra price every hour at the wall socket.

    For virtualization, the last time I talked to the VM vendors about attach rate, they said that their attach rate to platform matched the market (i.e. ~75% of their software was landing on 2P systems). So in the case of virtualization you can move to the R815 and still enjoy the economics of the 2P world but get the scalability of the 4P products.
  • tech6 - Thursday, September 9, 2010 - link

    I don't disagree but the RAS issue also dictates the longevity of the platform. I have been in the hosting business for a while and we see memory errors bring down 2 year+ old HP blades in alarming numbers. If you budget for a 4 year life cycle, then RAS has to be high on your list of features to make that happen.
  • mino - Thursday, September 9, 2010 - link

    Generally I would agree except that 2yr old HP blades (G5) are the worst way to ascertain commodity x86 platform reliability.
    Reasons:
    1) inadequate cooling setup (you better keep c7000 input air well below 20C at all costs)
    2) FBDIMM love to overheat
    3) G5 blade mobos are BIG MESS when it comes to memory compatibility => they clearly underestimated the tolerances needed

    4) All the points above hold true at least compared to HS21* and except 1) also against bl465*

    Speaking about 3yrs of operations of all three boxen in similar conditions. The most clear thi became to us when building power got cutoff and all our BladeSystems got dead within minutes (before running out of UPS by any means) while our 5yrs old BladeCenter (hosting all infrastructure services) remained online even at 35C (where the temp platoed thanks to dead HP's)
    Ironically, thanks to the dead production we did not have to kill infrastructure at all as the UPS's lasted for the 3 hours needed easily ...

Log in

Don't have an account? Sign up now