Items Tagged: Hortonworks
If you are in IT, 2013 is going to be the year that you will want to dive into the "big data" pool if you haven't been pushed in already. But don't worry - it's no longer sink or swim. For one, we'll be here to help coach IT folks through it all. And while the concepts, terminology and hype have been all over the place, once you start floating around you'll find that under the surface much of what fills the big data pool is familiar IT infrastructure, data management, and services re-cast around a few easy-to-grasp innovations. For example, if you are in IT and asked to pick a Hadoop distro to stand up, you'd probably start with evaluating the three main distributions of Hadoop (other than getting it straight off Apache) followed by other downstream OEM'd and pre-integrated versions. The main distros are from Cloudera, Hortonworks, and MapR. I didn't really appreciate the differences until talking with all three individually (at 2012 NY Strata, see below).
Big data technology is a big deal for storage shops, and a clear understanding of what it means -- and doesn't mean -- is required to successfully configure storage for big data apps.
When we talk about big data today we aren't talking just about the data and its three V’s (or up to 15 depending on who you consult), but more and more about the promise of big transformation to the data center. In other words, it’s about big money. First, consider recent news about some key Hadoop distro vendors. Many of them are now billion dollar players, much of that on speculation and expectation of future data center occupation....
At this month's Hadoop Summit 2015 I noted two big trends. One was the continuing focus on Spark as an expansion of the big data analytical ecosystem, with main sponsor Hortonworks and most vendors talking about how they support, interact, or deliver Spark in addition to Hadoop's MapReduce. The other was a very noticeable direction shifting focus from trotting out ever more gee-whiz big data use cases towards talking about how to make it all work in enterprise production environments. If you ask me, this second trend is the bigger deal for IT folks to pay attention to.
Analytics applications for big data have placed extensive demands on storage systems, which Mike Matchett says often requires new or modified storage structures.
- Premiered: 09/03/15
- Author: Mike Matchett
- Published: TechTarget: Search Storage
Infrastructure that supports big data comes from both the cloud and clusters. Enterprises can mix and match these seven infrastructure choices to meet their needs.
- Premiered: 07/19/16
- Author: Mike Matchett
- Published: TechTarget: Search IT Operations
Galactic Exchange announced today that the cloud version of ClusterGX™, the world's most easy to deploy and use Enterprise-Class Spark/Hadoop clustering platform, is now available for deployment on AWS free of charge.
Impetus Technologies Announces StreamAnalytix 3.0 Feat. Support for Apache Spark-Based Batch Process
Impetus Technologies, a big data thought leader and software solutions company, today announced StreamAnalytix 3.0 featuring support for Apache Spark-based batch processing and enriched online and offline machine learning features, helping enterprises maximize the performance of their analytical models and achieve the most favorable business outcomes.
- Premiered: 03/15/17
- Author: Taneja Group
- Published: Yahoo! Finance