Welcome!

Python Authors: Ignacio M. Llorente, Carmen Gonzalez, Elizabeth White, John Wetherill, Trevor Parsons

Blog Feed Post

Don’t Buy RAID cards for SSD Caching

“Where have you been Dog?”

The IT Dog here again, back from summer sabbatical. Well, it was more like an “election” sabbatical since I spent the last six months working on the campaign to elect Barkington T. Howl, III as President of United Dogs of America (UDA). “Bark,” as his frat brothers call him, turned out to be a little too high-brow and out of touch with the rank-and-file UDA constituency. He lost in a landslide. Anyway, I am now back at my day job, talking about SSDs in the IT marketplace. Did you miss me?

Back to Our Regularly Scheduled Blog Topic

So what is the first thing to write about since coming back? How about looking at a trend in the RAID marketplace – offering RAID controller cards sold with proprietary SSD caching software designed to boost performance over traditional RAID offerings.

SSD Caching RAID Controller Cards

Redundant Array of Independent Disks or RAIDs, as they are affectionately known, have been around for many years. They are a vital component in many IT installations offering data redundancy and performance improvements over standard disk array. With the introduction of SSDs and SSD caching, many RAID controller card manufactures have updated their product offering to include the ability to run SSD caching algorithms on the RAID controller card itself. Examples of this in the market place include LSI MegaRAID controller card with CacheCade SSD caching software and Adaptec Series 7 controller cards with maxCache SSD caching software. The basic idea here is to buy the controller card from a particular vendor and use the SSD caching software they offer that runs only on their controller card.

Show Me The Money

Ok, so I understand the idea. Let’s see if we can figure out if this is the right way to do SSD caching. I am going to talk about LSI’s solution, not because “LSI” is easier to type than “Adaptec”, but because there happen to be some independent test results published on the web by Demartek for LSI MegaRAID with CacheCade. I am going to try and decipher just what the results are telling me. You can click on the link to read the entire report which documents test set-up etc. I am just going to discuss one chart presented to see what the fuss is about.

Figure 1 below is from page 8 of the report. It shows throughput in Megabits per second for a 90 minute web server test for the baseline system with no SSD caching and for the same system with SSD caching using one or two Intel X25-E 32GB SLC SSDs. The chart shows the baseline system without SSD caching maxed out at about 58 Mbps and that using one SSD and SSD caching, the performance improved to approximately 211 Mbps. Pretty nice. 3.6X improvement. And, with 2 SSDs for caching the throughput improved to 416 Mbps. 7.1X improvement! Excellent.

But let me dig into this a little more. The first thing I am trying to understand is just how they could get only 58 Mbps out of the baseline system. Remember now this is MegaBITS per second, not MegaBYTES per second. I am not a RAID controller guru, but I would have expected the baseline performance to be greater than 7.25 Mbytes/s. A quick internet search of RAID performance led me to this ZDNet page which listed a test RAID performance in MegaBYTES/s ranging from 64 to 257. So the baseline figure for the LSI test is suspect and therefore 3.6X or 7.1X improvement of really bad performance is not that impressive. But, if we ignore ___X improvement and just look at the data – 211 Mbits/sec (that’s 26 Mbytes/sec to you and me) is nothing to be too excited about.

Figure 1. LSI MegaRAID with CaceCade Demartek Throughput. Source: http://www.demartek.com/Reports_Free/Demartek_LSI_CacheCade_Performance_...

Velobit SSD Caching Software Results

So, I was trying to come up with an apples-to-apples comparison for the RAID SSD caching test data shown above but unfortunately I don’t have data which exactly duplicates the LSI test. However, Demartek performed system testing using Velobit HyperCache SSD caching software under different conditions which can be used to provide some general comparison observations.

Figure 2. below is taken from page 12 the Velobit Demartek report. The chart title is confusing: “Average MBPS – Linux without RD2”. When translated to English, the title means: Average MegaBYTES/s of a Linux based system running the vdbench workload test. The ‘without RD2’ part of the title means that there is a second chart in the report with a test called ‘RD2’ (mostly read operations test) whose results (more than 3200 Mbytes/s) cause the other test results shown in the first graph on P12 to be compressed and difficult to read. So the RD2 test results are removed for this graph (see the report) to zoom in on RD1, RD3 and RD4 results. Well, that was a lot to explain for a chart title, sorry about that.

Anyway, several takeaways (don’t you just hate that word) from this data:
1. The baseline performance for each test ranged from approx. 20-35 Mbytes/s (Mbytes!). Much higher than the baseline for the LSI tests (7.25 Mbytes/s).
2. The Velobit Hypercache performance ranged from 200 to 275 Mbytes/s (that is 1600 - 2200 Mbits/s to keep the same units as the LSI results)
3. These tests results also show testing results of FlashSoft SSD caching software because FlashSoft was available at Demartek for comparison purposes. By the way, FlashSoft outperforms the LSI CacheCade software significantly also.


Figure 2. Velobit HyperCache SSD Caching Software Demartek Test Results. Source: http://www.velobit.com/Portals/106427/docs/demartek_velobit_ssd_caching_...

Conclusion

This was a long and winding road to try to make a simple point. RAID systems and SSD caching are two independent components within your IT system. If you have to go with a RAID solution for redundancy reasons, don’t get lured into the illusion that you can solve two problems with one product: you do need to have your RAID solution also be your SSD caching solution. It may seem easy to combine RAID and SSD caching for you, but if you try to solve two independent ‘problems’ with one product, you may not be getting the best performance for either problem. SSD caching enabled RAID controller cards:

  • tend to be more expensive than standard RAID controller cards
  • limit your SSD caching software solutions to software compatible with that card (vendor lock-in)
  • do not perform as well as other SSD caching options


The benefit of using RAID based SSD caching is the software runs on the RAID card and does not consume any server CPU/memory resources. However, the benefits don’t seem to be worth the cost and performance hit you take by using this solution.

Read the original blog entry...

More Stories By Peter Velikin

Peter Velikin has 12 years of experience creating new markets and commercializing products in multiple high tech industries. Prior to VeloBit, he was VP Marketing at Zmags, a SaaS-based digital content platform for e-commerce and mobile devices, where he managed all aspects of marketing, product management, and business development. Prior to that, Peter was Director of Product and Market Strategy at PTC, responsible for PTC’s publishing, content management, and services solutions. Prior to PTC, Peter was at EMC Corporation, where he held roles in product management, business development, and engineering program management.

Peter has an MS in Electrical Engineering from Boston University and an MBA from Harvard Business School.

@ThingsExpo Stories
Cultural, regulatory, environmental, political and economic (CREPE) conditions over the past decade are creating cross-industry solution spaces that require processes and technologies from both the Internet of Things (IoT), and Data Management and Analytics (DMA). These solution spaces are evolving into Sensor Analytics Ecosystems (SAE) that represent significant new opportunities for organizations of all types. Public Utilities throughout the world, providing electricity, natural gas and water, are pursuing SmartGrid initiatives that represent one of the more mature examples of SAE. We have s...
The security devil is always in the details of the attack: the ones you've endured, the ones you prepare yourself to fend off, and the ones that, you fear, will catch you completely unaware and defenseless. The Internet of Things (IoT) is nothing if not an endless proliferation of details. It's the vision of a world in which continuous Internet connectivity and addressability is embedded into a growing range of human artifacts, into the natural world, and even into our smartphones, appliances, and physical persons. In the IoT vision, every new "thing" - sensor, actuator, data source, data con...
The Internet of Things is tied together with a thin strand that is known as time. Coincidentally, at the core of nearly all data analytics is a timestamp. When working with time series data there are a few core principles that everyone should consider, especially across datasets where time is the common boundary. In his session at Internet of @ThingsExpo, Jim Scott, Director of Enterprise Strategy & Architecture at MapR Technologies, discussed single-value, geo-spatial, and log time series data. By focusing on enterprise applications and the data center, he will use OpenTSDB as an example t...
How do APIs and IoT relate? The answer is not as simple as merely adding an API on top of a dumb device, but rather about understanding the architectural patterns for implementing an IoT fabric. There are typically two or three trends: Exposing the device to a management framework Exposing that management framework to a business centric logic Exposing that business layer and data to end users. This last trend is the IoT stack, which involves a new shift in the separation of what stuff happens, where data lives and where the interface lies. For instance, it's a mix of architectural styles ...
The 3rd International Internet of @ThingsExpo, co-located with the 16th International Cloud Expo - to be held June 9-11, 2015, at the Javits Center in New York City, NY - announces that its Call for Papers is now open. The Internet of Things (IoT) is the biggest idea since the creation of the Worldwide Web more than 20 years ago.
An entirely new security model is needed for the Internet of Things, or is it? Can we save some old and tested controls for this new and different environment? In his session at @ThingsExpo, New York's at the Javits Center, Davi Ottenheimer, EMC Senior Director of Trust, reviewed hands-on lessons with IoT devices and reveal a new risk balance you might not expect. Davi Ottenheimer, EMC Senior Director of Trust, has more than nineteen years' experience managing global security operations and assessments, including a decade of leading incident response and digital forensics. He is co-author of t...
The Internet of Things will greatly expand the opportunities for data collection and new business models driven off of that data. In her session at @ThingsExpo, Esmeralda Swartz, CMO of MetraTech, discussed how for this to be effective you not only need to have infrastructure and operational models capable of utilizing this new phenomenon, but increasingly service providers will need to convince a skeptical public to participate. Get ready to show them the money!
The Internet of Things will put IT to its ultimate test by creating infinite new opportunities to digitize products and services, generate and analyze new data to improve customer satisfaction, and discover new ways to gain a competitive advantage across nearly every industry. In order to help corporate business units to capitalize on the rapidly evolving IoT opportunities, IT must stand up to a new set of challenges. In his session at @ThingsExpo, Jeff Kaplan, Managing Director of THINKstrategies, will examine why IT must finally fulfill its role in support of its SBUs or face a new round of...
One of the biggest challenges when developing connected devices is identifying user value and delivering it through successful user experiences. In his session at Internet of @ThingsExpo, Mike Kuniavsky, Principal Scientist, Innovation Services at PARC, described an IoT-specific approach to user experience design that combines approaches from interaction design, industrial design and service design to create experiences that go beyond simple connected gadgets to create lasting, multi-device experiences grounded in people's real needs and desires.
Enthusiasm for the Internet of Things has reached an all-time high. In 2013 alone, venture capitalists spent more than $1 billion dollars investing in the IoT space. With "smart" appliances and devices, IoT covers wearable smart devices, cloud services to hardware companies. Nest, a Google company, detects temperatures inside homes and automatically adjusts it by tracking its user's habit. These technologies are quickly developing and with it come challenges such as bridging infrastructure gaps, abiding by privacy concerns and making the concept a reality. These challenges can't be addressed w...
The Domain Name Service (DNS) is one of the most important components in networking infrastructure, enabling users and services to access applications by translating URLs (names) into IP addresses (numbers). Because every icon and URL and all embedded content on a website requires a DNS lookup loading complex sites necessitates hundreds of DNS queries. In addition, as more internet-enabled ‘Things' get connected, people will rely on DNS to name and find their fridges, toasters and toilets. According to a recent IDG Research Services Survey this rate of traffic will only grow. What's driving t...
Scott Jenson leads a project called The Physical Web within the Chrome team at Google. Project members are working to take the scalability and openness of the web and use it to talk to the exponentially exploding range of smart devices. Nearly every company today working on the IoT comes up with the same basic solution: use my server and you'll be fine. But if we really believe there will be trillions of these devices, that just can't scale. We need a system that is open a scalable and by using the URL as a basic building block, we open this up and get the same resilience that the web enjoys.
Connected devices and the Internet of Things are getting significant momentum in 2014. In his session at Internet of @ThingsExpo, Jim Hunter, Chief Scientist & Technology Evangelist at Greenwave Systems, examined three key elements that together will drive mass adoption of the IoT before the end of 2015. The first element is the recent advent of robust open source protocols (like AllJoyn and WebRTC) that facilitate M2M communication. The second is broad availability of flexible, cost-effective storage designed to handle the massive surge in back-end data in a world where timely analytics is e...
We are reaching the end of the beginning with WebRTC, and real systems using this technology have begun to appear. One challenge that faces every WebRTC deployment (in some form or another) is identity management. For example, if you have an existing service – possibly built on a variety of different PaaS/SaaS offerings – and you want to add real-time communications you are faced with a challenge relating to user management, authentication, authorization, and validation. Service providers will want to use their existing identities, but these will have credentials already that are (hopefully) i...
"Matrix is an ambitious open standard and implementation that's set up to break down the fragmentation problems that exist in IP messaging and VoIP communication," explained John Woolf, Technical Evangelist at Matrix, in this SYS-CON.tv interview at @ThingsExpo, held Nov 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA.
P2P RTC will impact the landscape of communications, shifting from traditional telephony style communications models to OTT (Over-The-Top) cloud assisted & PaaS (Platform as a Service) communication services. The P2P shift will impact many areas of our lives, from mobile communication, human interactive web services, RTC and telephony infrastructure, user federation, security and privacy implications, business costs, and scalability. In his session at @ThingsExpo, Robin Raymond, Chief Architect at Hookflash, will walk through the shifting landscape of traditional telephone and voice services ...
Explosive growth in connected devices. Enormous amounts of data for collection and analysis. Critical use of data for split-second decision making and actionable information. All three are factors in making the Internet of Things a reality. Yet, any one factor would have an IT organization pondering its infrastructure strategy. How should your organization enhance its IT framework to enable an Internet of Things implementation? In his session at Internet of @ThingsExpo, James Kirkland, Chief Architect for the Internet of Things and Intelligent Systems at Red Hat, described how to revolutioniz...
Bit6 today issued a challenge to the technology community implementing Web Real Time Communication (WebRTC). To leap beyond WebRTC’s significant limitations and fully leverage its underlying value to accelerate innovation, application developers need to consider the entire communications ecosystem.
The definition of IoT is not new, in fact it’s been around for over a decade. What has changed is the public's awareness that the technology we use on a daily basis has caught up on the vision of an always on, always connected world. If you look into the details of what comprises the IoT, you’ll see that it includes everything from cloud computing, Big Data analytics, “Things,” Web communication, applications, network, storage, etc. It is essentially including everything connected online from hardware to software, or as we like to say, it’s an Internet of many different things. The difference ...
Cloud Expo 2014 TV commercials will feature @ThingsExpo, which was launched in June, 2014 at New York City's Javits Center as the largest 'Internet of Things' event in the world.