Virtualization is at the center of all 21st Century IT systems, yet many CIOs fail to fully understand all of the benefits it can deliver to the data center operation. When we think of virtualization, we think compute, network, and storage—and we mostly think about driving up utilization on each. Since their introduction, storage controllers have offered the ability to carve out pieces of real storage from a large pool and deliver them efficiently to a number of hosts. Additional virtualization techniques and functions continue to offer improvements that further drive storage efficiency. IBM has been quietly addressing storage virtualization with SAN Volume Controller (SVC) for the last six years and has built up a significant technical lead in this space.
Overview - Storage in the Data Center
Strange but true: most infrastructure architectures are deliberately designed from the outset to need little or no change over their lifetimes. There are two main reasons for this:- Change often means outages and customer impact and must be avoided
- Budgets are set at the beginning of a project and getting more cash later is tough
Endemic Over-provisioning
The storage salesperson loves this approach. Always over-provisioning gets her more volume at a higher price than the much more sensible approach of buying what is really needed, just in time, would. In fact, many vendors’ storage sales strategies are aimed at just that issue, making it easy for the customer to over-provision. Actually, the work behind capacity and performance management is boring, tedious, and needs active participation from both the business unit and application development groups. Many organizations cut those IT service management jobs a few business cycles back as no one would notice immediately. The infrastructure architect knows this and builds in so much additional performance and capacity that it covers for the lack of attention to the capacity and performance management process. Here is a real life (and personal) example of what actually happens on the ground: An application has just been rolled out that, at launch, actually needs 20 GB of storage and enough performance to support 3 transactions per minute, peak. It is provisioned with 700 GB of storage on Tier 1 (320 GB, 15k Fibre Channel disks)—enough performance and capacity to deal with the best possible business case in 10 years. When the application got ported to a new hardware platform five years later, it was actually using 40 GB of storage at 60 transactions per minute, peak. To put some perspective around the commercial impact, over the five year hardware life, this level of over-provisioning consumed more than 6 megawatt hours of excess electricity and thousands of dollars in capital depreciation and maintenance costs. This example is repeated again and again across IT systems, making the cost of failing to do the job right, billions of dollars of waste, and many gigawatt hours of wasted energy an unnecessary data center plant. Even if some capacity and performance management had been performed that was able to identify in-life just how over-provisioned applications are, the difficulty of migrating data and applications to a lower tier of storage, plus the resultant service outage, would make the exercise impossible to justify.Operational Paralysis
In many high risk, high impact environments, such as wholesale banking, (where application outage costs can be greater that the cost of the hardware), provisioning totally separate infrastructure for each business unit is an entirely accepted practice. The thinking is that by separating the infrastructure, change risk can be mitigated and obtaining change approval or permission from the business to schedule an outage is easier. The problem here is not the change itself, but the impact of the change. Changes that reduce service availability are unpopular, difficult to manage, and sometimes significantly delay implementation. Sometimes, significant software and firmware patches can’t be installed; other times, unreliable hardware can’t be replaced—all because doing so would cause a service impacting outage. Well run IT infrastructure organizations set themselves up to avoid the need for changes that impact service availability. Unfortunately, changes to storage have been notoriously difficult to execute without impacting servers and applications.The Continuous Migration Architecture
In addition to improved utilization, virtualization offers the prospect of isolating the application from the underlying hardware to such an extent that application migration is viable in real time as part of normal business operations. At ESG, we refer to this as the “continuous migration architecture.” A continuous migration architecture delivers simultaneously on operational flexibility, business agility, and reduced costs. In effect, a CMA is the embodiment of the private cloud, including compute, storage, and network virtualization.What Value Does Virtualization Deliver?
Storage virtualization is all about operational efficiency, all about service levels and all about flexibility. Consolidation is nice, but it is only a small piece of the overall value. Storage virtualization, on the other hand, is NOT about disconnecting the physical from the physical—it is actually about breaking the link between the physical storage and the application. It is about being able to perform normal maintenance activities with no service impact—and being able to make very significant changes to the underlying physical environment without anyone noticing. Compute, network, and storage virtualization all deliver two key benefits.- Operational flexibility – the ability to separate the application from the underlying physical hardware
- High utilization of assets – sharing the resources between multiple consumers
IBM SAN Volume Controller (SVC)
IBM has delivered 15,000 SVC nodes to 5,000 customers and, as a result, has built up a huge amount of experience with most configurations and environments. IBM’s approach reflects the reality of today’s data center: the need to support multiple different hardware and software platforms is a consolidated and simplified way. The ability to work across multiple server virtualization hypervisors (VMware, Hyper-V, XEN, PowerVM, zLPAR) means that data center managers can fully support multiple virtualization environments (spanning Oracle Linux [XEN], Microsoft [HyperV], and VMware) all at the same time and from the same virtualized storage platform. The ability to work across multiple storage controllers—including EMC, Hitachi, HP, Pillar, IBM, Sun, Fujitsu, and others—enables incredible operational flexibility. Universal connectivity enables the introduction and retirement of storage systems in a flexible and non disruptive way. It also allows for flexible storage tiering, so when the workload increases on an application and the relevant disk component is unable to sustain the performance required, migration to a higher performance platform or tier can be managed transparently. A less important, but nevertheless relevant, issue is the ability to introduce competition into the storage buying process, because the issue of compatibility with what you already have is gone. Remote and local site mirroring between storage controllers of different types (e.g., Clariion to DMX, IBM to HP) means that cost effective disaster recovery platforms can be constructed from the best fit platforms.
The Bottom Line
Organizations that fail to adopt virtualization and the continuous migration architecture will find the costs of running their IT organizations grow faster than IT budgets will allow. The opportunity to make changes and react responsively to business demands will diminish as the cost of maintaining the legacy becomes all consuming and the cost of change becomes unaffordable. Businesses facing an unresponsive IT organization have typically responded in predictable ways, adopting the following approaches:- Do nothing and let more responsive businesses win
- Outsource IT
- Make a major IT investment once every five years to clean up the mess
- Get a new CIO/CTO
- Reduction of operational costs – reduced headcount
- Reduction of change risk
- Reduction of overprovisioning and overspending risks – reduced capital and operational costs
- Improved change cycle time, improved business agility
Printer-Friendly Version.
Please login to view a printer-friendly PDF version of this document. If you are not a member, please register. When you register, you will be able to view PDF versions of all our freely available documents, and rate and comment on site content.
For important information about using this content, please review our Terms & Conditions





