Storage Tutorial: Live from Spark Summit East with Continuum

This Storage Tutorial was filmed live at Spark Summit East.


Our host, Brian Chang, is joined by Peter Wang, president of Continuum, along with show regulars Irshad Raihan and Greg Kleiman of Red Hat Big Data. Peter fills the group in about what buzz he is hearing at the conference as well as what sorts of big data use cases he’s seeing best supported on Spark. Read on for an excerpt of the conversation, but check out the video for the full discussion.

Continue reading “Storage Tutorial: Live from Spark Summit East with Continuum”

Storage & Big Data Tutorial – Live at Strata+Hadoop World with Greg Kleiman

Strata+Hadoop World 2015 is underway in San Jose, CA and Red Hat Storage is on the scene. Today, Brian Chang chats with Greg Kleiman, director of storage and big data at Red Hat, to learn what news Greg is expecting to hear at the show and also what Red Hat is up to. Watch the video or read on for a summary.

Continue reading “Storage & Big Data Tutorial – Live at Strata+Hadoop World with Greg Kleiman”

Big data in risk management: Red Hat and Hortonworks to the rescue

The real estate bubble of 2008 made shockwaves worldwide. This was so significant an event that the US economy only this year was determined to have recovered to pre-bubble levels. But this event also resulted in all sorts of other shockwaves — regulatory ones.

To address these regulatory needs for Wall Street, Red Hat Storage teamed up with Hortonworks to build an enterprise grade big data risk management solution. At the recent Strata+Hadoop World, Vamsi Chemitiganti (chief architect, financial services) presented the solution in a session — which you can see for yourself at the bottom of this post.

Continue reading “Big data in risk management: Red Hat and Hortonworks to the rescue”

Can Big Data Swing Elections?

Big Data is everywhere. We were recently at the Strata Hadoop World conference and were blown away by some of the use cases in industries you wouldn’t normally categorize as being data driven. More and more leaders are seeking out data scientists to massage, splice, and contort data to spill out useful, actionable insights they can use as a competitive advantage.

In the last few years, with the access to bigger and richer data sets, we’ve moved from an era where data itself was a competitive advantage to the point where data analytics is the new battleground. What you can do with the data matters much more that how much data you have in the data center.

Continue reading “Can Big Data Swing Elections?”

What Can a Paper Shredder Teach Us About Big Data?

by Irshad Raihan, Red Hat Storage – Big Data Product Marketing

The trusty paper shredder in my home office died last week. I’m in the market for a new one. Years ago, when I purchased “Shreddy” (of course, it had a name) after a brief conversation with a random store clerk, choices were few and information scarce. In fact, paper shredders weren’t really considered standard personal office equipment as they are today. Most good shredders were built for offices not homes. Back in the market more than a decade later, it’s clear that the search for a new shredder is going to be trickier than I had imagined.

A paper shredder is a lot like big data.

Continue reading “What Can a Paper Shredder Teach Us About Big Data?”

Linux, Gluster and the FRANKENSTEIN MONSTER!!!

I_Frankenstein_Poster(WARNING: Video contains graphic, horrific fight sequences, which contain violence and gore; parental guidance suggested)

What do open source software solutions like the Linux operating system and the Gluster distributed file system have in common w/ the Frankenstein Monster? Don’t know?!? Arrrgghhh!!!

Continue reading “Linux, Gluster and the FRANKENSTEIN MONSTER!!!”

The Data Life Cycle Has Changed. Are You Ready?

by Irshad Raihan, Red Hat Storage – Big Data Product Marketing

Digital data has been around for centuries in one form or the other. Commercial tabulating machines have been available since the late 1800’s when they were used for accounting, inventory and population census. Why then do we label today as the Big Data age? What dramatically changed in the last 10-15 years that has the entire IT industry chomping at the bit?

More data? Certainly. But that’s only the tip of the iceberg. There are two big drivers that have contributed to the classic V’s (Volume, Variety, Velocity) of Big Data. The first is the commoditization of computing hardware – servers, storage, sensors, cell phones – basically anything that runs on silicon. The second is the explosion in the number of data authors – both machines and humans.

Continue reading “The Data Life Cycle Has Changed. Are You Ready?”

Red Hat at 2014 Hadoop Summit this week (June 3-5)

The Big Data community is converging on San Jose, CA this week (June 3-5) for 2014 Hadoop Summit. If you plan to attend, please be sure to stop by the Red Hat booth (#G18) to learn more about Red Hat’s Big Data solution. More details are available here:

And to sign up for 2014 Hadoop Summit, register here:

See you at the conference!

IDC Study in Enterprise Hadoop Deployments


IDC Study Finds Customers Use More Than Hadoop to Holistically Analyze Data

IDC has taken a look at the ways businesses are using Hadoop in conjunction with traditional analytics to draw insight out of their data. The IDC White Paper, sponsored by Red Hat, entitled “Trends in Enterprise Hadoop Deployments” (October 2013) takes a look at what percentage of businesses have existing Hadoop deployments, immediate plans to deploy, and long-term plans to deploy. The white paper also investigated the ways in which businesses use Hadoop to analyze big data. What becomes clear is that businesses use Hadoop in a variety of ways and in concert with other platforms. The outcome of this is that some enterprises are looking to alternative persistent storage systems that go beyond HDFS. Red Hat Storage offers GlusterFS as an alternative to HDFS and ranks with IBM’s Global File system (GPFS), and EMC Isilon OneFS as the top offerings, thanks to its strong reputation for being robust, scale-out and open source.

Continue reading “IDC Study in Enterprise Hadoop Deployments”

A Crucial Piece of the Hadoop Ecosystem: Scale-out Storage

By now you may have heard our announcement that GlusterFS 3.3 will be able to serve as a scale-out data store for Apache Hadoop. This is one area where a solution has been sorely needed, and we’re happy to provide one. Here’s the relevant piece from the press release:

Compatible with existing MapReduce-based applications and with the ability to co-exist with HDFS, this new functionality opens up data within Hadoop deployments to any file- or object-based application, providing a broad new range of options for big data deployments.

Some of the benefits for Hadoop users include:

  • Fast access to data – with no centralized metadata or name server, GlusterFS delivers better performance and easier manageability, with no centralized point of failure;
  • N-way replication – synchronous and asynchronous replication enables users to replicate data, changes are tracked and queued to ensure data stays synchronized regardless of latency or potential network interruptions;
  • High-availability – GlusterFS includes built-in failover, self-healing and no single point of failure;
  • Increased flexibility in sizing – with Gluster, Hadoop deployments are no longer limited to large block sizes or 16 TB volume sizes;
  • POSIX-compliant NAS (NFS, CIFS) and object access to files within Hadoop deployments – with Gluster, big data deployments can now enable operating systems to directly mount the storage, making it much easier to get data into and out of the system;
  • Easy access – to both file and object storage which simplifies management and controls storage costs; and,
  • Open source – available under a GPLv3 open source license.

Want to learn more?

Gluster will host a meetup on the topic of GlusterFS 3.3 at Gluster HQ in Sunnyvale on August 30, beginning at 6:00 pm PDT. Topics will include storage for Hadoop, as well as Unified File and Object storage.

We will also host a webinar, “What’s new in GlusterFS 3.3” on Tuesday, August 30 at 2 p.m. ET. To register for the event visit – this is a good way to interact with Gluster experts and learn more about our recent announcements.

The Hadoop GlusterFS 3.3 beta 2 is available now. To participate in the public beta simply visit our 3.3 beta resource page: To sign up as an official beta site, contact or