Click here to close now.

Welcome!

Eclipse Authors: XebiaLabs Blog, Ken Fogel, Sematext Blog, Marcin Warpechowski, Trevor Parsons

News Feed Item

Elasticsearch Now Certified on Cloudera Enterprise 5; Releases New Hadoop Connector

Elasticsearch Unlocks Potential for Businesses to Get Immediate Insights out of Data They Store in Hadoop

LOS ALTOS, CA and AMSTERDAM, THE NETHERLANDS -- (Marketwired) -- 06/19/14 -- Elasticsearch, Inc., the company on a mission to make data useful to businesses by delivering the world's most advanced search and analytics engine, today announced the 2.0 release of its Hadoop connector, Elasticsearch for Apache Hadoop, along with certification on Cloudera Enterprise 5. With Cloudera certification, Elasticsearch is now compatible across all Apache-based Hadoop distributions, including HortonWorks and MapR, helping businesses extract immediate insights regardless of where their hundreds of terabytes or even petabytes of data are stored.

Elasticsearch is the search and analytics engine behind the ELK stack, which also utilizes Logstash, a log management tool, and Kibana's powerful data visualization capabilities to help businesses pull vital information from their data stores. When used in conjunction with Hadoop, organizations no longer need to run a batch process and wait hours to analyze their data -- Elasticsearch for Apache Hadoop can pipe data to Elasticsearch for indexing as it's being generated, making it available for search and analysis in a matter of seconds. Kibana can also be used to explore massive amounts of data in Elasticsearch through easy-to-generate pie charts, bar graphs, scatter plots, histograms, and more.

How Businesses Leverage Elasticsearch and Hadoop
Elasticsearch is becoming the critical piece of pulling data from any environment and getting it into the hands of developers, engineering leads, CTOs, and CIOs who need insight into moving parts of their business at the rate they are happening. Customer examples include:

  • Klout, which stores petabytes of its 400 million+ users' data in a Hadoop Distributed File System and connects it to Elasticsearch. Klout query results, used to build targeted marketing campaigns, are delivered in seconds rather than minutes.
  • MutualMind, which enables customers like AT&T, Kraft, Nestle, and Starbucks to monitor their brands on social networks. After its Hadoop batches started taking 15+ minutes, MutualMind moved to Elasticsearch to power its real-time analytics, while utilizing Hadoop for statistical analysis.
  • An international financial services firm that uses Elasticsearch to analyze its access logs in just minutes instead of having to wait hours to run MapReduce jobs. Because Elasticsearch provided insights so quickly on the firm's large amounts of data, they've been able to increase the window of data they can analyze from one hour to a full week.

Key Features of Elasticsearch for Apache Hadoop

  • The ability to read and write data between Hadoop and Elasticsearch: Lets businesses get immediate, actionable insights by writing their data to Elasticsearch for real-time search and analysis. Complex jobs that would normally take minutes or hours to run in Hadoop can be handled quickly in Elasticsearch and read right back to Hadoop.
  • Native integration and support for popular Hadoop libraries: Lets users run queries natively on Hadoop through MapReduce, Hive, Pig, or Cascading APIs.
  • Snapshot/Restore: Makes it easy to take a snapshot of data within Elasticsearch -- perhaps a year's worth -- and archive it in Hadoop. At any time, the snapshot can be restored back to Elasticsearch for additional analysis.

Supporting Quotes from Cloudera, Elasticsearch, and Klout

Steven Schuurman, co-founder and CEO, Elasticsearch
"Hadoop was created to store and archive data at a massive scale, but businesses need to be able to ask, iterate, and extract actionable insights from this data -- which is what we designed our products for. With today's certification from Cloudera, Elasticsearch now works with all Apache-based Hadoop distributions, and with it, solves the last mile of big data Hadoop deployments by getting big insights, fast."

Tim Stevens, vice president of Business and Corporate Development at Cloudera
"Part of our mission at Cloudera is to support and promote an open architecture and allow customers to leverage their technology investments. Together, Cloudera and Elasticsearch provide businesses with a solution that allows them to get insight out of massive amounts of data."

Felipe Oliveria, director of Engineering, Backend for Klout
"Elasticsearch has a very good integration with Hadoop. It allows us to export a Hive table to an index on Elasticsearch very easily. HBase is a great data store, and it allows random access to the data, which Elasticsearch is perfect for. Elasticsearch fits very nicely into our data pipeline."

Because Elasticsearch works across distributed, diverse environments, engineers can search, extract, clean up and analyze data whether it comes from log events, social media activity, support tickets, website analytics or product interactions. Thousands of businesses worldwide continue to adopt Elasticsearch to store, search and analyze any type of data in real time, including Bloomberg, Comcast, eBay, Facebook, GitHub, Mayo Clinic, McGraw-Hill, Netflix, The New York Times, Target, Verizon, WordPress and Yelp.

To download Elasticsearch for Apache Hadoop, visit http://www.elasticsearch.org/overview/hadoop/. To find out more about Elasticsearch, visit www.elasticsearch.com.

Upcoming webinar: Real-time Analytics and Anomaly Detection using Elasticsearch and Apache Hadoop
On Wednesday, August 20, 2014 at 9:00am PT/12:00pm ET, Elasticsearch will host a webinar that goes over the features and benefits of Elasticsearch for Apache Hadoop, including a demonstration of how to use it as a platform to perform search and analytics, such as anomaly detection. To register, visit http://www.elasticsearch.org/webinars/elasticsearch-and-apache-hadoop.

About Elasticsearch, Inc.
Elasticsearch is on a mission to make massive amounts of data usable for businesses everywhere by delivering the world's most advanced search and analytics engine. With a laser focus on achieving the best user experience imaginable, the Elasticsearch ELK stack -- comprised of Elasticsearch, Logstash and Kibana -- has become one of the most popular and rapidly growing open source solutions in the market. Used by thousands of enterprises in virtually every industry today, Elasticsearch, Inc. provides production support, development support and training for the full ELK stack.

Elasticsearch, Inc. was founded in 2012 by the people behind the Elasticsearch and Apache Lucene open source projects. Since its initial release, Elasticsearch has more than 9 million cumulative downloads. Elasticsearch, Inc. is backed by Benchmark Capital, Index Ventures and NEA, with headquarters in Amsterdam and Los Altos, California, and offices around the world.

More Stories By Marketwired .

Copyright © 2009 Marketwired. All rights reserved. All the news releases provided by Marketwired are copyrighted. Any forms of copying other than an individual user's personal reference without express written permission is prohibited. Further distribution of these materials is strictly forbidden, including but not limited to, posting, emailing, faxing, archiving in a public database, redistributing via a computer network or in a printed form.

@ThingsExpo Stories
All major researchers estimate there will be tens of billions devices - computers, smartphones, tablets, and sensors - connected to the Internet by 2020. This number will continue to grow at a rapid pace for the next several decades. With major technology companies and startups seriously embracing IoT strategies, now is the perfect time to attend @ThingsExpo, June 9-11, 2015, at the Javits Center in New York City. Learn what is going on, contribute to the discussions, and ensure that your enterprise is as "IoT-Ready" as it can be
Container frameworks, such as Docker, provide a variety of benefits, including density of deployment across infrastructure, convenience for application developers to push updates with low operational hand-holding, and a fairly well-defined deployment workflow that can be orchestrated. Container frameworks also enable a DevOps approach to application development by cleanly separating concerns between operations and development teams. But running multi-container, multi-server apps with containers is very hard. You have to learn five new and different technologies and best practices (libswarm, sy...
SYS-CON Events announced today that DragonGlass, an enterprise search platform, will exhibit at SYS-CON's 16th International Cloud Expo®, which will take place on June 9-11, 2015, at the Javits Center in New York City, NY. After eleven years of designing and building custom applications, OpenCrowd has launched DragonGlass, a cloud-based platform that enables the development of search-based applications. These are a new breed of applications that utilize a search index as their backbone for data retrieval. They can easily adapt to new data sets and provide access to both structured and unstruc...
As the Internet of Things unfolds, mobile and wearable devices are blurring the line between physical and digital, integrating ever more closely with our interests, our routines, our daily lives. Contextual computing and smart, sensor-equipped spaces bring the potential to walk through a world that recognizes us and responds accordingly. We become continuous transmitters and receivers of data. In his session at @ThingsExpo, Andrew Bolwell, Director of Innovation for HP's Printing and Personal Systems Group, discussed how key attributes of mobile technology – touch input, sensors, social, and ...
WebRTC defines no default signaling protocol, causing fragmentation between WebRTC silos. SIP and XMPP provide possibilities, but come with considerable complexity and are not designed for use in a web environment. In his session at @ThingsExpo, Matthew Hodgson, technical co-founder of the Matrix.org, discussed how Matrix is a new non-profit Open Source Project that defines both a new HTTP-based standard for VoIP & IM signaling and provides reference implementations.
SYS-CON Events announced today that the "First Containers & Microservices Conference" will take place June 9-11, 2015, at the Javits Center in New York City. The “Second Containers & Microservices Conference” will take place November 3-5, 2015, at Santa Clara Convention Center, Santa Clara, CA. Containers and microservices have become topics of intense interest throughout the cloud developer and enterprise IT communities.
Buzzword alert: Microservices and IoT at a DevOps conference? What could possibly go wrong? In this Power Panel at DevOps Summit, moderated by Jason Bloomberg, the leading expert on architecting agility for the enterprise and president of Intellyx, panelists will peel away the buzz and discuss the important architectural principles behind implementing IoT solutions for the enterprise. As remote IoT devices and sensors become increasingly intelligent, they become part of our distributed cloud environment, and we must architect and code accordingly. At the very least, you'll have no problem fil...
Almost everyone sees the potential of Internet of Things but how can businesses truly unlock that potential. The key will be in the ability to discover business insight in the midst of an ocean of Big Data generated from billions of embedded devices via Systems of Discover. Businesses will also need to ensure that they can sustain that insight by leveraging the cloud for global reach, scale and elasticity.
The 4th International Internet of @ThingsExpo, co-located with the 17th International Cloud Expo - to be held November 3-5, 2015, at the Santa Clara Convention Center in Santa Clara, CA - announces that its Call for Papers is open. The Internet of Things (IoT) is the biggest idea since the creation of the Worldwide Web more than 20 years ago.
"People are a lot more knowledgeable about APIs now. There are two types of people who work with APIs - IT people who want to use APIs for something internal and the product managers who want to do something outside APIs for people to connect to them," explained Roberto Medrano, Executive Vice President at SOA Software, in this SYS-CON.tv interview at Cloud Expo, held Nov 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA.
The 17th International Cloud Expo has announced that its Call for Papers is open. 17th International Cloud Expo, to be held November 3-5, 2015, at the Santa Clara Convention Center in Santa Clara, CA, brings together Cloud Computing, APM, APIs, Microservices, Security, Big Data, Internet of Things, DevOps and WebRTC to one location. With cloud computing driving a higher percentage of enterprise IT budgets every year, it becomes increasingly important to plant your flag in this fast-expanding business opportunity. Submit your speaking proposal today!
In their session at @ThingsExpo, Shyam Varan Nath, Principal Architect at GE, and Ibrahim Gokcen, who leads GE's advanced IoT analytics, focused on the Internet of Things / Industrial Internet and how to make it operational for business end-users. Learn about the challenges posed by machine and sensor data and how to marry it with enterprise data. They also discussed the tips and tricks to provide the Industrial Internet as an end-user consumable service using Big Data Analytics and Industrial Cloud.
17th Cloud Expo, taking place Nov 3-5, 2015, at the Santa Clara Convention Center in Santa Clara, CA, will feature technical sessions from a rock star conference faculty and the leading industry players in the world. Cloud computing is now being embraced by a majority of enterprises of all sizes. Yesterday's debate about public vs. private has transformed into the reality of hybrid cloud: a recent survey shows that 74% of enterprises have a hybrid cloud strategy. Meanwhile, 94% of enterprises are using some form of XaaS – software, platform, and infrastructure as a service.
Sensor-enabled things are becoming more commonplace, precursors to a larger and more complex framework that most consider the ultimate promise of the IoT: things connecting, interacting, sharing, storing, and over time perhaps learning and predicting based on habits, behaviors, location, preferences, purchases and more. In his session at @ThingsExpo, Tom Wesselman, Director of Communications Ecosystem Architecture at Plantronics, will examine the still nascent IoT as it is coalescing, including what it is today, what it might ultimately be, the role of wearable tech, and technology gaps stil...
The explosion of connected devices / sensors is creating an ever-expanding set of new and valuable data. In parallel the emerging capability of Big Data technologies to store, access, analyze, and react to this data is producing changes in business models under the umbrella of the Internet of Things (IoT). In particular within the Insurance industry, IoT appears positioned to enable deep changes by altering relationships between insurers, distributors, and the insured. In his session at @ThingsExpo, Michael Sick, a Senior Manager and Big Data Architect within Ernst and Young's Financial Servi...
The Workspace-as-a-Service (WaaS) market will grow to $6.4B by 2018. In his session at 16th Cloud Expo, Seth Bostock, CEO of IndependenceIT, will begin by walking the audience through the evolution of Workspace as-a-Service, where it is now vs. where it going. To look beyond the desktop we must understand exactly what WaaS is, who the users are, and where it is going in the future. IT departments, ISVs and service providers must look to workflow and automation capabilities to adapt to growing demand and the rapidly changing workspace model.
Since 2008 and for the first time in history, more than half of humans live in urban areas, urging cities to become “smart.” Today, cities can leverage the wide availability of smartphones combined with new technologies such as Beacons or NFC to connect their urban furniture and environment to create citizen-first services that improve transportation, way-finding and information delivery. In her session at @ThingsExpo, Laetitia Gazel-Anthoine, CEO of Connecthings, will focus on successful use cases.
One of the biggest impacts of the Internet of Things is and will continue to be on data; specifically data volume, management and usage. Companies are scrambling to adapt to this new and unpredictable data reality with legacy infrastructure that cannot handle the speed and volume of data. In his session at @ThingsExpo, Don DeLoach, CEO and president of Infobright, will discuss how companies need to rethink their data infrastructure to participate in the IoT, including: Data storage: Understanding the kinds of data: structured, unstructured, big/small? Analytics: What kinds and how responsiv...
Building low-cost wearable devices can enhance the quality of our lives. In his session at Internet of @ThingsExpo, Sai Yamanoor, Embedded Software Engineer at Altschool, provided an example of putting together a small keychain within a $50 budget that educates the user about the air quality in their surroundings. He also provided examples such as building a wearable device that provides transit or recreational information. He then reviewed the resources available to build wearable devices at home including open source hardware, the raw materials required and the options available to power s...
With major technology companies and startups seriously embracing IoT strategies, now is the perfect time to attend @ThingsExpo in Silicon Valley. Learn what is going on, contribute to the discussions, and ensure that your enterprise is as "IoT-Ready" as it can be! Internet of @ThingsExpo, taking place Nov 3-5, 2015, at the Santa Clara Convention Center in Santa Clara, CA, is co-located with 17th Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry players in the world. The Internet of Things (IoT) is the most profound change in personal an...