Taneja Group | Cloudera
Join Newsletter
Forgot
password?
Register
Trusted Business Advisors, Expert Technology Analysts

Items Tagged: Cloudera

news / Blog

Enterprise IT Will Dive Into Big Data Solutions in 2013

If you are in IT, 2013 is going to be the year that you will want to dive into the "big data" pool if you haven't been pushed in already. But don't worry - it's no longer sink or swim. For one, we'll be here to help coach IT folks through it all. And while the concepts, terminology and hype have been all over the place, once you start floating around you'll find that under the surface much of what fills the big data pool is familiar IT infrastructure, data management, and services re-cast around a few easy-to-grasp innovations. For example, if you are in IT and asked to pick a Hadoop distro to stand up, you'd probably start with evaluating the three main distributions of Hadoop (other than getting it straight off Apache) followed by other downstream OEM'd and pre-integrated versions. The main distros are from Cloudera, Hortonworks, and MapR. I didn't really appreciate the differences until talking with all three individually (at 2012 NY Strata, see below).

  • Premiered: 01/15/13
  • Author: Mike Matchett
Topic(s): Big Data Hadoop Cloudera MapR Hortonworks Dell EMC strata Apache
news

Myths Surrounding Big Data Technology

Big data technology is a big deal for storage shops, and a clear understanding of what it means -- and doesn't mean -- is required to successfully configure storage for big data apps.

  • Premiered: 08/08/13
  • Author: Mike Matchett
  • Published: Tech Target: Search Storage
Topic(s): TBA Big Data TBA Storage TBA Cloudera TBA Apache TBA Hadoop TBA HDFS TBA MapR TBA NFS TBA CIFS TBA EMC TBA Isilon TBA DDN TBA DataDirect Networks TBA hScaler TBA Hortonworks
news / Blog

A Billion Here, A Billion There - Big Data Is Big Money

When we talk about big data today we aren't talking just about the data and its three V’s (or up to 15 depending on who you consult), but more and more about the promise of big transformation to the data center. In other words, it’s about big money. First, consider recent news about some key Hadoop distro vendors. Many of them are now billion dollar players, much of that on speculation and expectation of future data center occupation....

  • Premiered: 04/10/14
  • Author: Mike Matchett
Topic(s): Big Data Hadoop Pivotal Cloudera Hortonworks
news

Hadoop and Big Data Without Storage Headaches

Big Data appliances provide an easy way to implement storage for Hadoop.

  • Premiered: 06/23/15
  • Author: Taneja Group
  • Published: Datamation
Topic(s): TBA Hadoop TBA Big Data TBA Storage TBA Mike Matchett TBA Hadoop Distributed File System TBA HDFS TBA EMC TBA Isilon TBA HDS TBA Hitachi Data Systems TBA analytics TBA MapR TBA SAN TBA Cloudera TBA EMR TBA Amazon TBA HSP TBA hyper-converged
news

Big data analytics applications impact storage systems

Analytics applications for big data have placed extensive demands on storage systems, which Mike Matchett says often requires new or modified storage structures.

  • Premiered: 09/03/15
  • Author: Mike Matchett
  • Published: TechTarget: Search Storage
Topic(s): TBA Mike Matchett TBA Big Data TBA analytics TBA Storage TBA Primary Storage TBA scalability TBA Business Intelligence TBA BI TBA AWS TBA Amazon AWS TBA S3 TBA HPC TBA High Performance Computing TBA High Performance TBA ETL TBA HP Haven TBA HP TBA Hadoop TBA Vertica TBA convergence TBA converged TBA IOPS TBA Capacity TBA latency TBA scale-out TBA software-defined TBA software-defined storage TBA SDS TBA YARN TBA Spark
news

Mobile gaming company plays new Hadoop cluster management card

Chartboost, which operates a platform for mobile games, turned to new cluster management software in an effort to overcome problems in controlling the use of its Hadoop processing resources.

  • Premiered: 01/05/16
  • Author: Taneja Group
  • Published: TechTarget: Search Data Management
Topic(s): TBA Chartboost TBA mobile TBA cluster TBA Cluster Management TBA Hadoop TBA processing TBA data processing TBA analytics TBA Big Data TBA MapReduce TBA Hive TBA Spark TBA Optimization TBA Cloudera TBA AWS TBA Amazon TBA Cloud TBA YARN TBA Pepperdata TBA Memory TBA CPU TBA Application TBA Concurrent TBA SLA TBA service-level agreement TBA HBase TBA application performance TBA application performance management TBA Mike Matchett
news / Blog

Kudu Might Be Invasive: Cloudera Breaks Out Of HDFS

For the IT crowd just now getting to used to the idea of big data's HDFS (Hadoop's Distributed File System) and it's peculiarities, there is another alternative open source big data file system coming from Cloudera called Kudu. Like HDFS, Kudu is designed to be hosted across a scale-out cluster of commodity systems, but specifically intended to support more low-latency analytics. At it's heart, Kudu sits between the capabilities of HDFS and HBase to meet the growing use of interactive drill-down analytics (e.g. Impala) and the faster time-to-response Spark platform. It's a combination of on disk column store technology (for low latency queries) fronted by an in-memory write layer (for low latency updates/inserts), and fully distributed across the cluster....

  • Premiered: 01/11/16
  • Author: Mike Matchett
Topic(s): Big Data Cloudera Storage Kudu MapR Teradata
news / Blog

Big Data Enterprise Maturity

It's time to look at big data again. Last week I was at Cloudera's growing and vibrant annual analyst event to hear the latest from the folks who know what's what. Then this week Strata (conference for data scientists) brings lots of public big data vendor announcements. A noticeable shift this year is less focus on how to apply big data and more about maturing enterprise features intended to ease wider data center level adoption. A good example is the "mixed big data workload QoS" cluster optimizating solution from Pepperdata.

  • Premiered: 03/29/16
  • Author: Mike Matchett
Topic(s): Cloudera Pepperdata Big Data
news

Spark speeds up adoption of big data clusters and clouds

Infrastructure that supports big data comes from both the cloud and clusters. Enterprises can mix and match these seven infrastructure choices to meet their needs.

  • Premiered: 07/19/16
  • Author: Mike Matchett
  • Published: TechTarget: Search IT Operations
Topic(s): TBA Apache Spark TBA Spark TBA Mike Matchett TBA Cloud TBA cloud cluster TBA cluster TBA Big Data TBA big data analytics TBA MapReduce TBA Business Intelligence TBA BI TBA MLlib TBA High Performance TBA hadoop cluster TBA HDFS TBA Hadoop Distributed File System TBA IBM TBA Hortonworks TBA Cloudera TBA capacity management TBA Performance Management TBA API TBA SAN TBA storage area networks TBA CAPEX TBA DataDirect Networks TBA HPC TBA Lustre TBA Virtualization TBA VM
news

Apache Spark Survey Reveals Increased Growth in Users

In order to better understand Apache Spark’s growing role in big data, Taneja Group conducted a major market research project, surveying approximately 7,000 people.

  • Premiered: 11/08/16
  • Author: Taneja Group
  • Published: Satellite Press Releases
Topic(s): TBA Apache TBA Apache Hadoop TBA Apache Spark TBA Hadoop TBA Storage TBA Big Data TBA Data Management TBA Cloudera TBA In-Memory TBA Mike Matchett
news

Machine learning and data science workloads ignite Apache Spark adoption

The use of Apache Spark is dramatically increasing as new workloads create more use cases.

  • Premiered: 11/08/16
  • Author: Taneja Group
  • Published: CBR Online
Topic(s): TBA Apache TBA Apache Spark TBA Spark TBA Machine Learning TBA Big Data TBA Storage TBA Cloudera TBA Mike Matchett TBA analytics TBA Hadoop TBA Cloud TBA Public Cloud TBA Private Cloud TBA IBM TBA MapReduce
news / Blog

The New Big Thing in Big Data: Results From Our Apache Spark Survey

In the last few months I’ve been really bullish on Apache Spark as an big enabler of wider big data solution adoption. Recently we got the great opportunity to conduct some deep Spark market research (with Cloudera’s sponsorship) and were able to survey nearly seven thousand (6900+) highly qualified technical and managerial people working with big data from around the world. ... Some highlights -- First, across the broad range of industries, company sizes, and big data maturities, over one-half (54%) of respondents are already actively using Spark to solve a primary organizational use case. That’s an incredible adoption rate....

  • Premiered: 12/14/16
  • Author: Mike Matchett
Topic(s): Apache Spark Big Data Cloudera
news

With Apache Spark, Old Mainframes Learn New Tricks

Running Spark on the mainframe can be advantageous because data is co-located. One use is fraud detection.

  • Premiered: 12/26/16
  • Author: Taneja Group
  • Published: RT Insights
Topic(s): TBA Apache TBA Apache Spark TBA Spark TBA IBM TBA ETL TBA Cloudera TBA Big Data
news / Blog

Galactic Exchange Delivers the World's Easiest to Deploy Enterprise-Class Spark Clustering Tech

Galactic Exchange announced today that the cloud version of ClusterGX™, the world's most easy to deploy and use Enterprise-Class Spark/Hadoop clustering platform, is now available for deployment on AWS free of charge.

  • Premiered: 03/08/17
  • Author: Taneja Group
  • Published: Yahoo! Finance
Topic(s): Galactic Exchange Mike Matchett cluster Cloud AWS Amazon AWS Hybrid Cloud Virtualization Cloudera Hortonworks Amazon EBS HDFS Spark Hadoop Big Data
news

Impetus Technologies Announces StreamAnalytix 3.0 Feat. Support for Apache Spark-Based Batch Process

Impetus Technologies, a big data thought leader and software solutions company, today announced StreamAnalytix 3.0 featuring support for Apache Spark-based batch processing and enriched online and offline machine learning features, helping enterprises maximize the performance of their analytical models and achieve the most favorable business outcomes.

  • Premiered: 03/15/17
  • Author: Taneja Group
  • Published: Yahoo! Finance
Topic(s): TBA StreamAnalytix TBA Impetus Technologies TBA Mike Matchett TBA Apache Spark TBA Spark TBA Apache TBA Machine Learning TBA big data analytics TBA Big Data TBA Hadoop TBA Apache Hadoop TBA NoSQL TBA Apache Kafka TBA Apache Storm TBA Internet of Things TBA IoT TBA ETL TBA Cloudera TBA Hortonworks TBA MapR TBA Amazon AWS TBA AWS TBA Amazon TBA Amazon S3