BlueArc logo

On the differences between Memory-Hierarchy and Intelligent-Tiering

Posted on February 27, 2010 at 7:32 pm | Tags: , , , ,

On the differences between Memory-Hierarchy and Intelligent-Tiering

What started as a quiet Saturday of catching up on “stuff” is turning into a wake-up call. Data indeed has value, and collecting information in my many RSS feeds folders, is not equivalent to reviewing them in a timely manner. Since there is so much relevant data that has accumulated, I need to devise a mechanism to tier this large amount of data by value, age, relevancy, urgency, importance, repetitive -which should be de-duplicated and so forth and so on.

I have missed a fun week. Tom Georgens’ statement during NetApp’s F3Q10 Earnings Call has generated a buzz. Having read several posts by my esteemed colleagues, I will add this “Weekly Edition” to the multiple threads.

Since I did not participate in NetApp’s Earnings Call, I can only base my understanding upon the call transcript.

…”frankly I think the concept of tiering is dying. And I probably don’t want to go into a long speech on that, but at the end of the day, the simple fact of the matter is tiering is a way to manage migration of data between fiber-channel based systems and serial ATA-based systems. With the advent of Flash, and we talked about our performance acceleration module, basically these systems are going to large amounts of Flash, which are going to be dynamic with serial ATA behind them, and the whole concept of HSM and tiered storage is going to go away…”

This statement by NetApp’s CEO combines Flash Technology, SSDs, SATA and FC drives with Tiering and HSM. Confusing media technology with use cases and fluidly of data does lead to wrong conclusions.

There is no argument concerning what tiers of memory will be used in future systems. In 2008 I gave a talk at a local Silicon Valley panel I organized about Flash Technology in the Enterprise. During the panel I hosted I did not use a crystal ball attempting to predict the future. The agenda centered on explaining current trends. Most of what I’ve said in 2008 is still valid.

…”With flash memory headed for its 3rd consecutive year of 60% per annum price decline, lots of activity has been generated around deployment of flash technology in the enterprise. Enterprise Storage deployment has taken several routes: SSDs (Solid State Drives), Traditional hierarchical memories (caches), I/O acceleration modules, etc. Most deployments focus on utilizing flash technology as a new tier of performance, since prices are an order of magnitude more expensive than HDD technology. Enterprise storage deployment has also highlighted several of the inherent limitation with current flash technology. While all system and storage vendors have already announced available solutions utilizing flash technology as “tier 0” storage, several startups have raised VC funds to design systems and controllers targeted at the enterprise storage market…”

Prices have stabilized, startups came out of stealth, MLC is gaining on SLC, controllers to increase robustness and mitigating (lack of) reliability issues emerged, no more FC HDDs will be sold after 2010, yet nothing really changed.

Ironically, since I limited the examples I’ve used during my talk to products that have already been announced, I used NetApp’s PAM (Mk I) which was already announced in 2008 to make two points: (a) explaining caching to masses and (b) explaining the lack of maturity of Flash, since PAM Mk I used DRAM technology, and explaining (Which is not the same as predictingJ), that PAM Mk II will use Flash Technology.

The example Tom Georgens used is about Memory-Hierarchy Design. Since slower memory is cheaper (I include Silicon Technology, Rotating Media and off-line archiving media all under “Memory”), and thus the corollary is that faster memory is smaller, all computer systems deploy hierarchical memory architectures. Since the cost of a storage system made of SRAMs will equal the national debt, and since the latency of accessing slow drives directly (in a many to many scenario), is larger than the time it will take the user to drink multiple cups of coffee waiting for the data to be retrieved, we all look for the optimal ratios and the relevant technology per each level in the hierarchy. Since the dawn of computer Systems all designers optimized for temporal and spatial localities of references, and designing such that hit rates will increase and miss penalties will decrease.

Indeed NetApp uses Flash Technology to create another level in the hierarchy. It is a good example of optimizing for size and cost. The disappearance of 15k RPM Fibre Channel drives is another valid example of one media Technology being replaced by another. Between Flash Technology and SATA drives, the economics for yet another level diminish. (Avoiding a crystal ball to predict future cost, currently, depending on the drive attributes, the cost per bit is between one and two order of magnitude more expensive between Flash and SATA.)

So, based on all of the above, why do all those frequently used and much appreciated storage tiers keep insisting that the rumors of their death have been greatly exaggerated?

Since the definition of storage tiers as defined by NetApp (“the simple fact of the matter is tiering is a way to manage migration of data between fiber-channel based systems and serial ATA-based systems”…), is wrong, or misleading, or both.

 

Most of the good arguments have already been made. Without any intention to ignore other posts, I did like the post and several of the comments posted on The Storage Architect.

I also like the post by my good friend Michael Hay.

I am also cognizant of being redundant with some of my points being the Johnny-come-lately to this debate.

 

Storage Tiering is not “simply a way to manage migration of data between fiber-channel based systems and serial ATA-based systems”. While the media tiers have, are and will change according to the economics of the media, the foundation of Intelligent Tiering will retain its validity.

 

Since no data ever gets erased, and since data has value associated with it, it is about transparent, automated, fluid movement of the data.

Tiering is definitely not dying. NetApp does not offer this functionality or capability so it makes sense that they would try to divert people’s attention away from tiering. I don’t think customers’ attention will be so easily diverted.

Let’s start with the data already in place. Customers have invested billions of dollars in Fibre Channel, SAS, and SATA disks that make up primary, secondary, and archive tiers.  They need intelligent, efficient tools to manage the movement of their data between these tiers. The data is the lifeblood of any organization and its transparent, automated, fluid movement is important.

 What is dead are the days of excess and seemingly unlimited budgets to just keep adding more high performance disk drives to satisfy the insatiable need for storage. For this reason customers have to be able to match the value of their data to the appropriate disk technology to meet access requirements and efficiently utilize current infrastructure to squeeze every drop of ROI from it. With fewer people to manage this infrastructure, automating the movement of data through an easy to use policy-based interface adds to ROI by making better use of storage administrators’ time. But here is the real key – making this movement transparent. Transparent, meaning the storage admin does not have to “reconnect” applications after data has been moved, nor do users have to be notified that their data now resides in a different location. In other words, the data has moved to a different tier, but the access path from the original point of storage is preserved, no matter how many times the data moves between tiers.

So the real discussion is not about tiering, but about transparency. The SAN vendors can do this at the block level and have been able to for years. But for the NAS vendors, doing this at the file level has only been achieved by BlueArc. We call it Intelligent Tiering and have been doing it since 2006.

Another key advantage of Tiering is the ability to extend our transparent, Intelligent Tiering to NAS systems other than our own. The ability to transparently migrate data to foreign file systems is what has enabled HDS and BlueArc to migrate from BlueArc to HCP or BlueArc to NetApp, with the major difference being that the data migration engine is intrinsic to the NAS device and not an external appliance. (This is better articulated by Michael Hay). We also provide software to address spikes in demand for data on secondary or archive tiers so that high performance access is assured no matter what type of disk drive the actual data is being read from. So is it any wonder that the biggest NAS vendor in the market would like everyone to think “tiering is dying”? They can’t come close to offering Intelligent Tiering, and if I were their customer I may start to think they are seriously out of touch with my needs.

Share this post:
  • Digg
  • del.icio.us
  • Facebook
  • Reddit
  • StumbleUpon
  • Technorati

BlueArc’s API Specifications for Intelligent Data Migration published under Open Source License

Posted on January 15, 2010 at 5:31 pm

In one of his previous blog entries, Michael Hay of Hitachi/HDS eloquently addressed file system virtualization and migration embedded into a NAS device. Quoting Michael: When we look at how HNAS can either tier internally or off to external systems using NFS this is a far more robust solution to the problem because there are fewer components.  Add in the Hitachi Data Discovery Suite and you get an advanced ILM offering that uses full content indexing”

Miki Sandorfi of HDS followed up in his blog entry about synergy, and I quote again: “This is precisely why we implemented policy-based intelligent tiering between our Hitachi NAS and Hitachi Content Platform”

This week, BlueArc has published the API specifications for Intelligent Data Migration under the Apache Open Source License.

BlueArc has long championed the concept of tiered storage, and has for many years supported the use of multiple storage technologies underneath SiliconFS.

BlueArc offers an intelligent data migration policy-based engine which allows storage administrators to implement their data movement policies. Data Migration is transparent to end-users and applications and does not require external ILM or data management devices. Because Data Migration is an embedded feature of SiliconFS, all filesystem functions (e.g., Snapshots, replication, quotas, etc.) work seamlessly as if the data were still on the original storage tier and data integrity is maintained during the migration or recall. As far as end-users and appli­cations are concerned the data has not moved at all.

This transparent and intelligent migration also supports seamless migration to external devices. This capability is defined as Cross-Volume Links. Cross-Volume Links are designed to operate either with internal BlueArc storage tiers or external, 3rd-party storage devices. It is the incorporation of external storage devices which greatly extends the reach of Data Migrator, and thus the entire BlueArc Transparent Data Migration strategy. Data can be migrated from tier to tier to tier, even to external tiers, and still be managed and presented to hosts and applications as a single cohesive whole. This is transparent, end-to-end data migration, a very powerful example of Transparent Data Mobility. The remote target file systems operate over NFS or HTTP protocols.

This embedded fluidity mechanism allows for integration with an ecosystem of partners, including content-archive storage tiers (Hitachi’s HCP), as discussed by Michael and Miki in their respective blogs.

Data migration can also be controlled via an API that has now been made available under the Apache open source license to any partner who plans to integrate into this ecosystem.

As more solution partners discover how to integrate work with BlueArc’s SiliconFS, more 3rd-party solutions will be incorporated into the data management framework, giving customers the capability of using both BlueArc and 3rd-party storage devices within a single, powerful, and transparent data migration strategy.

Share this post:
  • Digg
  • del.icio.us
  • Facebook
  • Reddit
  • StumbleUpon
  • Technorati

Celebrating (almost) a quarter century of satisfying CGI’s insatiable appetite for speed

Posted on January 6, 2010 at 7:57 pm | Tags: , , , , , , , , , , ,

Celebrating (almost) a quarter century of satisfying CGI’s insatiable appetite for speed

I am extremely proud to be part of the BlueArc team which supplied the NAS cluster on which Avatar was created.

Avatar is as the future of movies. A 3-D, high definition, computer animated movie, which was made possible with current technology.

The making of Avatar required years of CGI rendering. The rendering was done using over 4,000 compute blades (Top 200 on the top 500 supercomputer list). Weta Digital in Wellington, NZ built a top data center which enabled the creation of Avatar.

For throughput a 10 GB network connects 40,000 processor to a large amount of high speed storage.

A 12 node BlueArc Titan cluster served data at a rate of 8 GB/Sec to the compute grid. The final edited version of Avatar is over 2.8 TB of content.  The multiple raw scenes are in the 100’s of Terabytes.avatar

Avatar was released on December 17th, 2009. I am proud to have played my little role in supplying the high performance NAS gear needed to create such a movie.

On December 4th, 1985 a certain (forgotten) movie was released. It was also a movie that caused me to be proud of my team’s achievements. Like Avatar which required top gear and high performance equipment, the 24 year old movie relied on high performance computing.

That film, Young Sherlock Holmes, is notable for including the first fully computer-generated character, a knight composed of elements from a stained glass window. The effect was created by Lucasfilm’s John Lasseter (now Pixar Animation Studios). (The film was nominated for the 1985 Academy Award for Visual Effects.)

knight2The computer used to render the scene was CCI’s Power6/32 supermini. As a young engineer, I was part of the design team for the Power6/32 UNIX Supermini. The CCI Power6/32 Unix Supermini was selected by Lucasfilm as the CGI rendering platform for its speed.

BlueArc’s Titan and Mercury NAS system hold the top results for SpecSFS, which is the applicable benchmark.

In 1985, the applicable benchmarks were AT&T’s seven benchmarks, of which the B1 benchmark was the most important. The B1 benchmark was designed to strain to the limit a machine’s capacity to execute Calls and Returns. The machine performed 8 times faster than the top DEC VAX 11/780. (And was the first commercial port of UNIX System V).

The 30 seconds scene took 8 months to render on a single Power6/32 supermini. Compare it to the 40,000 processors rendering a 2 hour 45 minutes 3-D high def film ….WOW….

team1

And the Machine:

unix-world1

 

 

 

 

 

 

 

 

 

 What a magnificent way to celebrate a quarter century of satisfying the hunger for speed.

Humbly submitted by a proud member of the BlueArc team.

Share this post:
  • Digg
  • del.icio.us
  • Facebook
  • Reddit
  • StumbleUpon
  • Technorati

What surprises one finds far away from home

Posted on December 23, 2009 at 5:12 pm | Tags: , , ,

PlauqueWhat surprises one finds far away from home

Cochin, India is an amazing place! Chinese fishing nets dating back to Kublai Khan, A wonderful spice market inside old Portuguese buildings, a Portuguese church dating back to 1503, a synagogue dating back to 1568, the palace of the Maharajas and much more.

Ok, so you didn’t really expect me to use my blog as a platform to practice my travel stories. (But, for the history buff I am, Cochin was wonderful. I wish I was on Holiday…regrettably I just took off from business for an afternoon :-), the shirt loudly spells H-O-L-I-D-A-Y :-)).
I went to Cochin to deliver an invited talk at the HiPC conference. (www.hipc.org). The workshop I presented at was relevant to the file system development we perform here, at BlueArc, and was titled: Reconfigurable Computing. (http://www.ewh.ieee.org/conf/hprcw/program09.html).
My talk was titled: “Achieving High Performance in Network Attached Storage Systems through Novel Approach to FPGA Design and Implementation”. And depicted the implementation of the BlueArc file system. (And I did wear a completely different shirt :-)).

In attendance were several graduate students, who asked questions about file systems and FPGAs that showed a deep understanding of the subject matter. I did not remain puzzled for long.
Following the session I attended a poster session, at which many graduate students presented their research.
Venkatanathan Varadarajan and Sanath Kumar of the College of Engineering, Guindy,Anna University,
Chennai, presented a research paper about a method to accelerate a file system by implementing a directory search mechanism through FPGA based implementation.
Their full paper is accessible from: http://www.hipc.org/hipc2009/documents/HIPCSS09Papers/1569254101.pdf 

A Reconfigurable Hardware to Accelerate Directory Search Presentation

Suffice to say that we had a great discussion. The students were not cognizant of the BlueArc. They were ecstatic to learn about a complete implementation of a file system in FPGAs, and we will remain in touch.
There were additional very relevant papers presented about file systems and FPGA based implementations. I will not describe the other presentations.
Yet, this specific encounter was so unique, and has “made my day” (and theirs), so I wanted to share this experience.

Chinese Fishing Netsmy-rickshaw-for-the-day2

Share this post:
  • Digg
  • del.icio.us
  • Facebook
  • Reddit
  • StumbleUpon
  • Technorati

The most amazing scene factory in Wellington – My visit to see how Avatar was created

Posted on December 23, 2009 at 9:44 am | Tags: ,

The most amazing scene factory in Wellington – My visit to see how Avatar was created

What else can I write about Avatar that has not already been written in the mainstream media?
Success indeed has many fathers. Who does not want to be associated with the future of movies? I expect multiplicity of case studies, hundreds of blogs entries and lots of claims. Let me stake my claim to fame as the provider of the high performance Network Storage (NAS) on which Avatar was rendered. Many IT giants provided equipment to Weta Digital Data Center, we provided the “Ferraris” (to quote a Weta Digital person). When speed is everything, claiming top gear accolades is a source of pride indeed.
On December 10th, exactly one week before the release of Avatar, I followed the summer to Wellington.
I visited with Paul Ryan – Weta Digital’s CTO, Adam Shand who built this amazing data center, Matt Provost who manages the systems and their team mates for an interactive round table discussion focusing on Weta Digital’s challenges and ways for BlueArc to assist with future products and enhancements.
Following our roundtable technical brainstorming session (which will not get summarized in this blog entry  ), Adam, Paul and Matt took me for a tour of the data center. Having engineering genes, my DNA does not provide for being easily impressed. I was impressed with this data center! Weta Digital’s data center is an extremely dense data center, which ranks in the top 200 of the top 500 list of the world’s fastest supercomputers. With such compute power and density the tour started at the cooling facility. The cooling facility is a marvel of engineering achievement and eco friendliness.

The racks are water cooled. The heat exchange is fully enclosed. Hot air is sucked into a radiator and cycled through the front of the systems. The heat exchange is passive and because of the prevailing low temperatures in Wellington, all that is needed is to run water pumps to get rid of the heat. The heat exchanges through evaporation which occurs on top of the facility depicted in the enclosed photo.
Adam proudly pointed out that Weta Digital retained a local contractor to design the cooling infrastructure, a local engineering firm which usually designs dairies. Apparently both milk and water are liquids that one who has the experience knows how to handle. Hat’s off to Weta Digital for energy efficiency.
Another anecdote I’ve learned while touring the facility was that the raised floor is 1.2 meter high, which is tall enough to contain all the water below the computer gear in the event of a catastrophic leak.
As I initially pointed out I have no doubt that the rest of the impressive gear will be covered by case studies and blog entries by the respective vendors. (And I doubt I will do them justice), so I just focused on what I did find to be the most impressive eco friendly energy efficient data center.
Serving all the 4,000 server blades and the multiple of thousands of submitted jobs per night, over 10 Gigabit Ethernet, are the NAS storage racks from BlueArc.
The load on the storage system is enormous. BlueArc NAS systems handled over 8 GB/Sec, while maintaining 24X7 high availability. (Unlike some other HPC batch processes, CGI rendering is made up of multiple individual rendering tasks which cannot be check pointed, thus failure is unacceptable). Paul defines the data center as a “scenes factory”.
A minute of Avatar equates to 17.28 GB. Multiply by the length of the movie which if I remember correctly is 2 hours and 45 minutes.
Go watch the movie!

cooling

Share this post:
  • Digg
  • del.icio.us
  • Facebook
  • Reddit
  • StumbleUpon
  • Technorati

In Praise of Partnerships

Posted on September 13, 2009 at 6:08 pm

In Praise of Partnerships

My friend, Michael Hay, has recently posted a note titled “And all that NAS” on his blog.

http://blogs.hds.com/michael/2009/09/and-all-that-nas.html

This post was focused on the partnership HDS has established with BlueArc. Indeed this post got me to stop and realize the value of partnerships, which should not be taken for granted.

Innovations are successful upon solving a real customer problem. In his book, the Innovator’s Dilemma, Clayton Christensen makes the case for a “company within a company” to be responsible for a company’s response to disruptive technology.  While Christensen (who did serve on the board of directors of Snap, a NAS company in my previous life), defines disruptive technologies in his book as lower-end innovations, I will leverage the “company within a company” theme in his book. According to Christensen, incumbents generally do not react to disruptive innovations until it’s too late, because they do not represent an interesting market.

 This brings me to the voice of the customer theme. As Christensen highlights, sustaining technological changes are not the issue for industry leaders (Hitachi/HDS is definitely such a leader).  Industry leaders are good at sustaining their innovations. Beyond the technology focus, HDS (and Michael in particular) has allowed BlueArc to establish a large virtual customer council. When I chose the term “Voice of the Customer,” I had no intentions of limiting it to the classic six sigma VOC definition (which I know very little about…).  HDS serves a much larger and more diversified market than BlueArc, and as such allows BlueArc (and the inter-company NAS team) to add customer centric focus, deployment and solutions to the constant quest for more innovation.

Last but not least, this post is in Praise of Innovation;  Innovations resulting from what Michael so beautifully captured as “on the proverbial backs of napkins.”

I’ve kept some of the napkins, and will bring them with me to celebrate the 1,000th, 10,000th, 100,000th and beyond HNAS 3080 and 3090 system deployments.

Share this post:
  • Digg
  • del.icio.us
  • Facebook
  • Reddit
  • StumbleUpon
  • Technorati

BlueArc introduces Mercury

Posted on July 6, 2009 at 9:18 pm

BlueArc introduces Mercury.

Welcome to my “first” blog, more accurately, my first “official” blog on the BlueArc site. I have sporadically blogged when I felt like sharing some technical idea, but starting my “official” BlueArc blog will commit me to post periodically. Indeed a challenge when it seems like anything worth writing home about has already been written…J

Today, BlueArc introduced Mercury, its next generation NAS server. Mercury is BlueArc’s entry into the “mid-range” NAS market segment, without compromising any of the attributes BlueArc is known for: Scalability, Performance, and Richness of features and Robustness.

Mercury, like BlueArc’s Titan flagship family of products provides sustained, predictable and consistent performance under various loads. The Mercury family of NAS servers is equally efficient with a variety of I/O sizes, loads and access patterns.

Last week I took some time off and went sailing. (So, please bear with me since my imagination is lagging and I’m still under the influence). We chartered the boat in Lefkas and headed south towards Ithaca and the many other islands in the Ionic sea. The boat was a new 2009 Impression 514 by Élan. There are faster boats designed for speed sailing, we have sailed a variety of other yachts in the past. The Élan proved itself as a great bluewater cruiser, handling easily in the one day when we encountered a Beaufort 6 sea, and the days when the sea was flat. Consistently predictable.

Back to the world of NAS….(and sorry for being a bit esoteric…). Performance for a NAS device is defined by three measurable attributes: Throughput, Ops/Sec and Scalability. While performance has been a key design goal for Mercury, consistency of performance is one of Mercury’s key distinguishing attributes.  In the remainder of this blog I will highlight the key design attributes of the Mercury NAS server.

The Mercury system leverages the architecture of BlueArc’s Titan NAS and combines it with a multi-core platform to achieve the best of both worlds.

The Mercury architecture combines a multi-core platform with a NAS “off-load” engine. This allows for the widest applicability to changing workloads, data sets and access patterns

The following attributes define the implementation:

  • High Degree of parallelism

      Parallelism is key to performance. Many systems attempt to achieve such parallelism. While processors based implementations can provide some parallelism (provided the data has parallelism), as demonstrated by traditional MIMD architectures, (Cache coherent or message passing), such implementations require synchronization that limits scalability. BlueArc chose fine grain parallelism by implementing in state machines.

  • Off-loading

      Off-loading allows the core file system to independently process metadata and move data while the multi-core processor module is dedicated to data management. This is similar to traditional coprocessors (DSPs, Systolic Arrays, Graphics engines). This architecture provides yet another degree of parallelism.

  • Pipelining

      Pipelining is achieved when multiple instructions are simultaneously overlapped in execution. For a NAS system it means multiple file requests overlapping in execution.

So, why offload the file system functions?  The key reason for it was the need to achieve massive fine grain parallelism. Some applications indeed lend themselves well to achieving parallelism with multiplicity of cores, most do not. Since a NAS system will “park” on network and storage resources, any implementation that requires multiplicity of processors will create synchronization chatter larger than the advantage of adding processing elements beyond a very small number. Thus, the idea of offloading to a “co-processor” required the design from the ground up of an inherently parallelized and pipelined processing element by design. Choosing a state machine approach and leveraging this design methodology by implementing in FPGAs provided the massive parallelism for the file system, as the synchronization was “free”.

This is a true hybrid model where the appropriate resources are used for the relevant task.
Compilers will improve; the compiler needs to have the data placed such that the communication overhead is reduced and exploit larger parallelism. Future progress will hinge on adapting programming extensions for newly developed applications. As long as Moore’s law provided the easy way out by making the single processor faster, progress slowed. Necessity will accelerate the adaptation of new paradigms in parallelism.

Back to the hybrid model: The best implementations have indeed used offloading. BlueArc’s implementation offloads the file system execution to a special purpose designed offload engine, while leveraging the processing core for data applications.

Genoa (Jib) for speed, main (sail) to stabilize the boat, and when the sea is flat there’s always the diesel. Esoteric indeed and to be continued.

Share this post:
  • Digg
  • del.icio.us
  • Facebook
  • Reddit
  • StumbleUpon
  • Technorati