Deploying PySpark on Red Hat Storage GlusterFS

by Steve Watt, Chief Architect, Big Data, Red Hat

Red Hat and Continuum Analytics are pleased to announce a new solution that allows customers to deploy PySpark on top of Red Hat Storage GlusterFS. If you’re attending Strata, you are encouraged to swing by the Red Hat Booth to grab a solution brief that describes how the solution is put together and how you can set it up. However, for those of you that are not at Strata, here’s the overview — and be sure to check out the technology brief, here.

Continue reading “Deploying PySpark on Red Hat Storage GlusterFS”

IDC Study in Enterprise Hadoop Deployments

Screenshot_5255

IDC Study Finds Customers Use More Than Hadoop to Holistically Analyze Data

IDC has taken a look at the ways businesses are using Hadoop in conjunction with traditional analytics to draw insight out of their data. The IDC White Paper, sponsored by Red Hat, entitled “Trends in Enterprise Hadoop Deployments” (October 2013) takes a look at what percentage of businesses have existing Hadoop deployments, immediate plans to deploy, and long-term plans to deploy. The white paper also investigated the ways in which businesses use Hadoop to analyze big data. What becomes clear is that businesses use Hadoop in a variety of ways and in concert with other platforms. The outcome of this is that some enterprises are looking to alternative persistent storage systems that go beyond HDFS. Red Hat Storage offers GlusterFS as an alternative to HDFS and ranks with IBM’s Global File system (GPFS), and EMC Isilon OneFS as the top offerings, thanks to its strong reputation for being robust, scale-out and open source.

Continue reading “IDC Study in Enterprise Hadoop Deployments”