Eclipse Authors: Yeshim Deniz, Liz McMillan, Elizabeth White, XebiaLabs Blog, Ken Fogel

News Feed Item

Elasticsearch Now Certified on Cloudera Enterprise 5; Releases New Hadoop Connector

Elasticsearch Unlocks Potential for Businesses to Get Immediate Insights out of Data They Store in Hadoop

LOS ALTOS, CA and AMSTERDAM, THE NETHERLANDS -- (Marketwired) -- 06/19/14 -- Elasticsearch, Inc., the company on a mission to make data useful to businesses by delivering the world's most advanced search and analytics engine, today announced the 2.0 release of its Hadoop connector, Elasticsearch for Apache Hadoop, along with certification on Cloudera Enterprise 5. With Cloudera certification, Elasticsearch is now compatible across all Apache-based Hadoop distributions, including HortonWorks and MapR, helping businesses extract immediate insights regardless of where their hundreds of terabytes or even petabytes of data are stored.

Elasticsearch is the search and analytics engine behind the ELK stack, which also utilizes Logstash, a log management tool, and Kibana's powerful data visualization capabilities to help businesses pull vital information from their data stores. When used in conjunction with Hadoop, organizations no longer need to run a batch process and wait hours to analyze their data -- Elasticsearch for Apache Hadoop can pipe data to Elasticsearch for indexing as it's being generated, making it available for search and analysis in a matter of seconds. Kibana can also be used to explore massive amounts of data in Elasticsearch through easy-to-generate pie charts, bar graphs, scatter plots, histograms, and more.

How Businesses Leverage Elasticsearch and Hadoop
Elasticsearch is becoming the critical piece of pulling data from any environment and getting it into the hands of developers, engineering leads, CTOs, and CIOs who need insight into moving parts of their business at the rate they are happening. Customer examples include:

  • Klout, which stores petabytes of its 400 million+ users' data in a Hadoop Distributed File System and connects it to Elasticsearch. Klout query results, used to build targeted marketing campaigns, are delivered in seconds rather than minutes.
  • MutualMind, which enables customers like AT&T, Kraft, Nestle, and Starbucks to monitor their brands on social networks. After its Hadoop batches started taking 15+ minutes, MutualMind moved to Elasticsearch to power its real-time analytics, while utilizing Hadoop for statistical analysis.
  • An international financial services firm that uses Elasticsearch to analyze its access logs in just minutes instead of having to wait hours to run MapReduce jobs. Because Elasticsearch provided insights so quickly on the firm's large amounts of data, they've been able to increase the window of data they can analyze from one hour to a full week.

Key Features of Elasticsearch for Apache Hadoop

  • The ability to read and write data between Hadoop and Elasticsearch: Lets businesses get immediate, actionable insights by writing their data to Elasticsearch for real-time search and analysis. Complex jobs that would normally take minutes or hours to run in Hadoop can be handled quickly in Elasticsearch and read right back to Hadoop.
  • Native integration and support for popular Hadoop libraries: Lets users run queries natively on Hadoop through MapReduce, Hive, Pig, or Cascading APIs.
  • Snapshot/Restore: Makes it easy to take a snapshot of data within Elasticsearch -- perhaps a year's worth -- and archive it in Hadoop. At any time, the snapshot can be restored back to Elasticsearch for additional analysis.

Supporting Quotes from Cloudera, Elasticsearch, and Klout

Steven Schuurman, co-founder and CEO, Elasticsearch
"Hadoop was created to store and archive data at a massive scale, but businesses need to be able to ask, iterate, and extract actionable insights from this data -- which is what we designed our products for. With today's certification from Cloudera, Elasticsearch now works with all Apache-based Hadoop distributions, and with it, solves the last mile of big data Hadoop deployments by getting big insights, fast."

Tim Stevens, vice president of Business and Corporate Development at Cloudera
"Part of our mission at Cloudera is to support and promote an open architecture and allow customers to leverage their technology investments. Together, Cloudera and Elasticsearch provide businesses with a solution that allows them to get insight out of massive amounts of data."

Felipe Oliveria, director of Engineering, Backend for Klout
"Elasticsearch has a very good integration with Hadoop. It allows us to export a Hive table to an index on Elasticsearch very easily. HBase is a great data store, and it allows random access to the data, which Elasticsearch is perfect for. Elasticsearch fits very nicely into our data pipeline."

Because Elasticsearch works across distributed, diverse environments, engineers can search, extract, clean up and analyze data whether it comes from log events, social media activity, support tickets, website analytics or product interactions. Thousands of businesses worldwide continue to adopt Elasticsearch to store, search and analyze any type of data in real time, including Bloomberg, Comcast, eBay, Facebook, GitHub, Mayo Clinic, McGraw-Hill, Netflix, The New York Times, Target, Verizon, WordPress and Yelp.

To download Elasticsearch for Apache Hadoop, visit http://www.elasticsearch.org/overview/hadoop/. To find out more about Elasticsearch, visit www.elasticsearch.com.

Upcoming webinar: Real-time Analytics and Anomaly Detection using Elasticsearch and Apache Hadoop
On Wednesday, August 20, 2014 at 9:00am PT/12:00pm ET, Elasticsearch will host a webinar that goes over the features and benefits of Elasticsearch for Apache Hadoop, including a demonstration of how to use it as a platform to perform search and analytics, such as anomaly detection. To register, visit http://www.elasticsearch.org/webinars/elasticsearch-and-apache-hadoop.

About Elasticsearch, Inc.
Elasticsearch is on a mission to make massive amounts of data usable for businesses everywhere by delivering the world's most advanced search and analytics engine. With a laser focus on achieving the best user experience imaginable, the Elasticsearch ELK stack -- comprised of Elasticsearch, Logstash and Kibana -- has become one of the most popular and rapidly growing open source solutions in the market. Used by thousands of enterprises in virtually every industry today, Elasticsearch, Inc. provides production support, development support and training for the full ELK stack.

Elasticsearch, Inc. was founded in 2012 by the people behind the Elasticsearch and Apache Lucene open source projects. Since its initial release, Elasticsearch has more than 9 million cumulative downloads. Elasticsearch, Inc. is backed by Benchmark Capital, Index Ventures and NEA, with headquarters in Amsterdam and Los Altos, California, and offices around the world.

More Stories By Marketwired .

Copyright © 2009 Marketwired. All rights reserved. All the news releases provided by Marketwired are copyrighted. Any forms of copying other than an individual user's personal reference without express written permission is prohibited. Further distribution of these materials is strictly forbidden, including but not limited to, posting, emailing, faxing, archiving in a public database, redistributing via a computer network or in a printed form.

@ThingsExpo Stories
SYS-CON Events announced today that Niagara Networks will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Niagara Networks offers the highest port-density systems, and the most complete Next-Generation Network Visibility systems including Network Packet Brokers, Bypass Switches, and Network TAPs.
SYS-CON Events announced today that Transparent Cloud Computing (T-Cloud) Consortium will exhibit at the 19th International Cloud Expo®, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. The Transparent Cloud Computing Consortium (T-Cloud Consortium) will conduct research activities into changes in the computing model as a result of collaboration between "device" and "cloud" and the creation of new value and markets through organic data proces...
Everyone knows that truly innovative companies learn as they go along, pushing boundaries in response to market changes and demands. What's more of a mystery is how to balance innovation on a fresh platform built from scratch with the legacy tech stack, product suite and customers that continue to serve as the business' foundation. In his General Session at 19th Cloud Expo, Michael Chambliss, Head of Engineering at ReadyTalk, will discuss why and how ReadyTalk diverted from healthy revenue an...
SYS-CON Media announced today that @WebRTCSummit Blog, the largest WebRTC resource in the world, has been launched. @WebRTCSummit Blog offers top articles, news stories, and blog posts from the world's well-known experts and guarantees better exposure for its authors than any other publication. @WebRTCSummit Blog can be bookmarked ▸ Here @WebRTCSummit conference site can be bookmarked ▸ Here
Intelligent machines are here. Robots, self-driving cars, drones, bots and many IoT devices are becoming smarter with Machine Learning. In her session at @ThingsExpo, Sudha Jamthe, CEO of IoTDisruptions.com, will discuss the next wave of business disruption at the junction of IoT and AI, impacting many industries and set to change our lives, work and world as we know it.
In today's uber-connected, consumer-centric, cloud-enabled, insights-driven, multi-device, global world, the focus of solutions has shifted from the product that is sold to the person who is buying the product or service. Enterprises have rebranded their business around the consumers of their products. The buyer is the person and the focus is not on the offering. The person is connected through multiple devices, wearables, at home, on the road, and in multiple locations, sometimes simultaneously...
Established in 1998, Calsoft is a leading software product engineering Services Company specializing in Storage, Networking, Virtualization and Cloud business verticals. Calsoft provides End-to-End Product Development, Quality Assurance Sustenance, Solution Engineering and Professional Services expertise to assist customers in achieving their product development and business goals. The company's deep domain knowledge of Storage, Virtualization, Networking and Cloud verticals helps in delivering ...
SYS-CON Events announced today that Hitrons Solutions will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Hitrons Solutions Inc. is distributor in the North American market for unique products and services of small and medium-size businesses, including cloud services and solutions, SEO marketing platforms, and mobile applications.
The Internet of Things (IoT), in all its myriad manifestations, has great potential. Much of that potential comes from the evolving data management and analytic (DMA) technologies and processes that allow us to gain insight from all of the IoT data that can be generated and gathered. This potential may never be met as those data sets are tied to specific industry verticals and single markets, with no clear way to use IoT data and sensor analytics to fulfill the hype being given the IoT today.
OnProcess Technology has announced it will be a featured speaker at @ThingsExpo, taking place November 1 - 3, 2016, in Santa Clara, California. Dan Gettens, OnProcess’ Chief Analytics Officer, will discuss how Internet of Things (IoT) data can be leveraged to predict product failures, improve uptime and slash costly inventory stock. @ThingsExpo is an annual gathering of IoT and cloud developers, practitioners and thought-leaders who exchange ideas and insights on topics ranging from Big Data in...
SYS-CON Events announced today that Enzu will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Enzu’s mission is to be the leading provider of enterprise cloud solutions worldwide. Enzu enables online businesses to use its IT infrastructure to their competitive advantage. By offering a suite of proven hosting and management services, Enzu wants companies to focus on the core of their online busine...
The explosion of new web/cloud/IoT-based applications and the data they generate are transforming our world right before our eyes. In this rush to adopt these new technologies, organizations are often ignoring fundamental questions concerning who owns the data and failing to ask for permission to conduct invasive surveillance of their customers. Organizations that are not transparent about how their systems gather data telemetry without offering shared data ownership risk product rejection, regu...
The Open Connectivity Foundation (OCF), sponsor of the IoTivity open source project, and AllSeen Alliance, which provides the AllJoyn® open source IoT framework, today announced that the two organizations’ boards have approved a merger under the OCF name and bylaws. This merger will advance interoperability between connected devices from both groups, enabling the full operating potential of IoT and representing a significant step towards a connected ecosystem.
November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Penta Security is a leading vendor for data security solutions, including its encryption solution, D’Amo. By using FPE technology, D’Amo allows for the implementation of encryption technology to sensitive data fields without modification to schema in the database environment. With businesses having their data become increasingly more complicated in their mission-critical applications (such as ERP, CRM, HRM), continued ...
SYS-CON Events announced today that Embotics, the cloud automation company, will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Embotics is the cloud automation company for IT organizations and service providers that need to improve provisioning or enable self-service capabilities. With a relentless focus on delivering a premier user experience and unmatched customer support, Embotics is the fas...
SYS-CON Events announced today that Cloudbric, a leading website security provider, will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Cloudbric is an elite full service website protection solution specifically designed for IT novices, entrepreneurs, and small and medium businesses. First launched in 2015, Cloudbric is based on the enterprise level Web Application Firewall by Penta Security Sys...
Smart Cities are here to stay, but for their promise to be delivered, the data they produce must not be put in new siloes. In his session at @ThingsExpo, Mathias Herberts, Co-founder and CTO of Cityzen Data, will deep dive into best practices that will ensure a successful smart city journey.
SYS-CON Events announced today that MathFreeOn will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. MathFreeOn is Software as a Service (SaaS) used in Engineering and Math education. Write scripts and solve math problems online. MathFreeOn provides online courses for beginners or amateurs who have difficulties in writing scripts. In accordance with various mathematical topics, there are more tha...
Successful digital transformation requires new organizational competencies and capabilities. Research tells us that the biggest impediment to successful transformation is human; consequently, the biggest enabler is a properly skilled and empowered workforce. In the digital age, new individual and collective competencies are required. In his session at 19th Cloud Expo, Bob Newhouse, CEO and founder of Agilitiv, will draw together recent research and lessons learned from emerging and established ...
Data is the fuel that drives the machine learning algorithmic engines and ultimately provides the business value. In his session at Cloud Expo, Ed Featherston, a director and senior enterprise architect at Collaborative Consulting, will discuss the key considerations around quality, volume, timeliness, and pedigree that must be dealt with in order to properly fuel that engine.