Click here to close now.


Eclipse Authors: Liz McMillan, XebiaLabs Blog, Ken Fogel, Sematext Blog, Marcin Warpechowski

News Feed Item

Alpine 4.0 Puts an End to Big Data's Compromise With a Single Platform for All Enterprise Data

Alpine Data Labs' New Enterprise Platform Allows Data Teams to Access All Enterprise Data, Work Bi-Directionally With Hadoop, Discover and Share Insights at the Speed of Business

SAN FRANCISCO, CA -- (Marketwired) -- 07/22/14 -- Alpine Data Labs announced today the introduction of Alpine Chorus 4.0, the industry's first Advanced Analytics enterprise platform that enables universal data discovery and search, bi-directional integration between Hadoop and all major data platforms, as well as compatibility with Spark and Cloudera 5.

Alpine Chorus 4.0 brings innovation in data discovery, query parallelization and machine learning in distributed environments. The company also introduces the first of its kind life cycle management facility for Hadoop and non-Hadoop platforms which allows for sophisticated machine learning algorithms to be run and managed simply across heterogeneous data systems such as Cloudera, MapR, Pivotal HD or databases like PostGreSQL, Oracle and Greenplum. Complimentary access to Alpine Chorus 4.0 can be found at

The Network Effect of Insights
"Research shows that only 4% of enterprises get business value out of their Big Data investment," says Joe Otto, President and CEO at Alpine Data Labs. "The current industry solutions encourage a siloed and non-scalable approach to Big Data and that simply limits progress. We focus on building the most comprehensive and scalable platform that enterprises can use to achieve Big Data ROI and to better connect people, data and insights. From helping people quickly visualize and work with any data, to running models 100 times faster on Spark, to operationalizing the deployment of real-time models via standards like PMML, customers using Alpine Chorus innovate faster because they can easily run deep algorithms at Big Data scale and in a timeframe of business relevance."

The new solution boasts over 100 new features and furthers the company's advantage in the field of Advanced Analytics. With Alpine Chorus 4.0, data scientists and engineers can be productive on any data -- Hadoop or not; business users are engaged early and quickly add value to the advanced analytics conversation; and finally, executives rely on a standard platform to build repeatable, secure and reusable analytical practices.

Over the last 6 months alone, the company has tripled its customer base and has grown by over 200% in the financial services, online media, government, retail and manufacturing sectors.

Data Discovery Made Simple
Most organizations cut into their competitive advantage early in the analytical process because their data scientists can't easily discover, assemble and transform data before working with it. That process can take months, because moving data is not simple and when it comes to working with Hadoop data, new skill sets need to be acquired.

Alpine Chorus 4.0's universal data discovery capability allows users to search, find and use data regardless of where it is. Using Alpine Chorus' "Google-like" search, users can find and browse any file, model, workflow, comment, dataset, etc. -- and when data is found, they can visualize it through powerful heat maps, scatter plots and histograms, all without data movement.

"This functionality alone made our team more effective. It allowed us to assemble and understand data quickly, without the complexity of working with MapReduce, or Pig or SQL," says Ron Rasmussen, CTO & SVP Engineering at Xactly Corp. "Our ability to work rapidly and iterate at Big Data scale is core to helping us deliver the best products to our customers."

Big Data Analytics at the Speed of Business
"Removing Hadoop's complexity will give any company a head start, but it's not enough," says Steven Hillion, co-founder and Chief Product Officer at Alpine Data Labs. "Once enterprises have identified the data they want to work with, they need to interrogate it without being encumbered by performance issues."

In this new release, the company unveils its Parallel Analytics Engine, a virtual layer that now executes all of Alpine Chorus' algorithms with multiple levels of parallelism. This includes the Workflow Graph Optimizer, which parses analytics workloads and deploys them in parallel to maximize the use of available resources; and the Polymorphic Data Service, which decides at run-time how to optimize queries for each type of data platform. These innovations, unique to Alpine Data Labs, represent the most efficient way to run sophisticated machine learning algorithms on a variety of distributed systems. They also made it possible for Alpine Chorus 4.0 to be the first Advanced Analytics platform to be certified on CDH5 and Spark, benchmarked running complex algorithms at up to a hundred times faster than previously possible.

"With Alpine Chorus 4.0 customers can work on important analytical issues at Big Data speed and keep the business engaged because of the solution's visual, powerful and collaborative approach," says Amr Awadallah, ‎Founder and CTO at Cloudera. "Alpine Chorus is a showcase for analytics innovation in the Big Data Era and we're excited that it features the power of Cloudera 5."

The Internet of People
"The key to analytical excellence is collaboration," says Dan Vesset, Vice President of IDC's Business Analytics research. "Collaboration often gets a bad name because it sounds too abstract. However, our research shows that effective cross-enterprise collaboration has a determinant role in helping Big Data projects succeed and return value. Alpine Data Labs is leading the way here."

The new features in Alpine Chorus 4.0 make the benefits of collaboration very tangible:

  • Data scientists can tap into the innovation of their business counterparts at every point in the analytics process through user-generated data: comments, tags, links and documents applied to models, workflows, datasets and sandboxes.
  • Business Analysts can easily and visually understand data science work through collaborative analytics workspaces, communicating and iterating in real-time, increasing the value and confidence of their analysis.
  • Data and IT engineers rely on Github-like version control features, job scheduling and data management capabilities and can operationalize Big Data Analytics in a secure and consistent manner.
  • Executives benefit from a platform that is innovative, open and secure because all interactions in Alpine Chorus are recorded and auditable.

Alpine Chorus 4.0 rests on key new technological breakthroughs:

1) Visualize Before You Analyze: Universal Search, Interactive Visualizations and Data Augmentation add a layer of understanding on top of any data.
2) Transform and Query Without Extraction: Alpine Chorus comprehensive library of transformation operators -- from simple filters, to variable, null-value replacement operators to pivot, multi-join and normalization functions -- are accessible via sql editor or visual, drag and drop icons. All of Alpine Chorus operators run in place and in parallel.
3) Manage Data In and Out of Hadoop: data can be sent to Hadoop for building Big Data Lakes and out of Hadoop to write the results of large-scale computation done on Big Data to operational systems.
4) Do Predictive Analytics Natively on Big Data: All of Alpine Chorus are written and optimized to execute in parallel, making analysis at Big Data speed a reality.
5) Work With the Latest Innovations: Embraces Data Science standards for real-time scoring (PMML), as well as supports and contributes to open source platform technologies (Spark, Sqoop, Madlib, MLlib, etc). First Advanced Analytics platform to be certified on Spark and Cloudera CDH 5.
6) Extend and Productionize Models: Alpine Chorus REST API available to run, and edit run user defined functions (UDFs) as part of an end-to-end analytic workflow.
7) Manage the Analytics Full-Life Cycle: Github-like Version Control (copy workflow, history capture, revert capability), Check-in, commenting, model review and tracking, Job Scheduling, Data management.

For more, try Alpine for free @

Alpine Chorus is the world's first Enterprise Platform for Advanced Analytics on Big Data and Hadoop. With Alpine, data scientists and business analysts can work with large data sets, develop and collaborate on models at scale without having to use code or download software. Leaders in all industries, from Financial Services to Healthcare, use Alpine to outsmart their competition. Maybe you should too. Find out more at:

Image Available:

More Stories By Marketwired .

Copyright © 2009 Marketwired. All rights reserved. All the news releases provided by Marketwired are copyrighted. Any forms of copying other than an individual user's personal reference without express written permission is prohibited. Further distribution of these materials is strictly forbidden, including but not limited to, posting, emailing, faxing, archiving in a public database, redistributing via a computer network or in a printed form.

@ThingsExpo Stories
SYS-CON Events announced today that Dyn, the worldwide leader in Internet Performance, will exhibit at SYS-CON's 17th International Cloud Expo®, which will take place on November 3-5, 2015, at the Santa Clara Convention Center in Santa Clara, CA. Dyn is a cloud-based Internet Performance company. Dyn helps companies monitor, control, and optimize online infrastructure for an exceptional end-user experience. Through a world-class network and unrivaled, objective intelligence into Internet conditions, Dyn ensures traffic gets delivered faster, safer, and more reliably than ever.
The buzz continues for cloud, data analytics and the Internet of Things (IoT) and their collective impact across all industries. But a new conversation is emerging - how do companies use industry disruption and technology enablers to lead in markets undergoing change, uncertainty and ambiguity? Organizations of all sizes need to evolve and transform, often under massive pressure, as industry lines blur and merge and traditional business models are assaulted and turned upside down. In this new data-driven world, marketplaces reign supreme while interoperability, APIs and applications deliver un...
The Internet of Things (IoT) is growing rapidly by extending current technologies, products and networks. By 2020, Cisco estimates there will be 50 billion connected devices. Gartner has forecast revenues of over $300 billion, just to IoT suppliers. Now is the time to figure out how you’ll make money – not just create innovative products. With hundreds of new products and companies jumping into the IoT fray every month, there’s no shortage of innovation. Despite this, McKinsey/VisionMobile data shows "less than 10 percent of IoT developers are making enough to support a reasonably sized team....
There are so many tools and techniques for data analytics that even for a data scientist the choices, possible systems, and even the types of data can be daunting. In his session at @ThingsExpo, Chris Harrold, Global CTO for Big Data Solutions for EMC Corporation, will show how to perform a simple, but meaningful analysis of social sentiment data using freely available tools that take only minutes to download and install. Participants will get the download information, scripts, and complete end-to-end walkthrough of the analysis from start to finish. Participants will also be given the pract...
The IoT market is on track to hit $7.1 trillion in 2020. The reality is that only a handful of companies are ready for this massive demand. There are a lot of barriers, paint points, traps, and hidden roadblocks. How can we deal with these issues and challenges? The paradigm has changed. Old-style ad-hoc trial-and-error ways will certainly lead you to the dead end. What is mandatory is an overarching and adaptive approach to effectively handle the rapid changes and exponential growth.
Today’s connected world is moving from devices towards things, what this means is that by using increasingly low cost sensors embedded in devices we can create many new use cases. These span across use cases in cities, vehicles, home, offices, factories, retail environments, worksites, health, logistics, and health. These use cases rely on ubiquitous connectivity and generate massive amounts of data at scale. These technologies enable new business opportunities, ways to optimize and automate, along with new ways to engage with users.
Internet of Things (IoT) will be a hybrid ecosystem of diverse devices and sensors collaborating with operational and enterprise systems to create the next big application. In their session at @ThingsExpo, Bramh Gupta, founder and CEO of, and Fred Yatzeck, principal architect leading product development at, discussed how choosing the right middleware and integration strategy from the get-go will enable IoT solution developers to adapt and grow with the industry, while at the same time reduce Time to Market (TTM) by using plug and play capabilities offered by a robust IoT ...
Mobile messaging has been a popular communication channel for more than 20 years. Finnish engineer Matti Makkonen invented the idea for SMS (Short Message Service) in 1984, making his vision a reality on December 3, 1992 by sending the first message ("Happy Christmas") from a PC to a cell phone. Since then, the technology has evolved immensely, from both a technology standpoint, and in our everyday uses for it. Originally used for person-to-person (P2P) communication, i.e., Sally sends a text message to Betty – mobile messaging now offers tremendous value to businesses for customer and empl...
Can call centers hang up the phones for good? Intuitive Solutions did. WebRTC enabled this contact center provider to eliminate antiquated telephony and desktop phone infrastructure with a pure web-based solution, allowing them to expand beyond brick-and-mortar confines to a home-based agent model. It also ensured scalability and better service for customers, including MUY! Companies, one of the country's largest franchise restaurant companies with 232 Pizza Hut locations. This is one example of WebRTC adoption today, but the potential is limitless when powered by IoT.
You have your devices and your data, but what about the rest of your Internet of Things story? Two popular classes of technologies that nicely handle the Big Data analytics for Internet of Things are Apache Hadoop and NoSQL. Hadoop is designed for parallelizing analytical work across many servers and is ideal for the massive data volumes you create with IoT devices. NoSQL databases such as Apache HBase are ideal for storing and retrieving IoT data as “time series data.”
Clearly the way forward is to move to cloud be it bare metal, VMs or containers. One aspect of the current public clouds that is slowing this cloud migration is cloud lock-in. Every cloud vendor is trying to make it very difficult to move out once a customer has chosen their cloud. In his session at 17th Cloud Expo, Naveen Nimmu, CEO of Clouber, Inc., will advocate that making the inter-cloud migration as simple as changing airlines would help the entire industry to quickly adopt the cloud without worrying about any lock-in fears. In fact by having standard APIs for IaaS would help PaaS expl...
NHK, Japan Broadcasting, will feature the upcoming @ThingsExpo Silicon Valley in a special 'Internet of Things' and smart technology documentary that will be filmed on the expo floor between November 3 to 5, 2015, in Santa Clara. NHK is the sole public TV network in Japan equivalent to the BBC in the UK and the largest in Asia with many award-winning science and technology programs. Japanese TV is producing a documentary about IoT and Smart technology and will be covering @ThingsExpo Silicon Valley. The program, to be aired during the peak viewership season of the year, will have a major impac...
SYS-CON Events announced today that ProfitBricks, the provider of painless cloud infrastructure, will exhibit at SYS-CON's 17th International Cloud Expo®, which will take place on November 3–5, 2015, at the Santa Clara Convention Center in Santa Clara, CA. ProfitBricks is the IaaS provider that offers a painless cloud experience for all IT users, with no learning curve. ProfitBricks boasts flexible cloud servers and networking, an integrated Data Center Designer tool for visual control over the cloud and the best price/performance value available. ProfitBricks was named one of the coolest Clo...
Organizations already struggle with the simple collection of data resulting from the proliferation of IoT, lacking the right infrastructure to manage it. They can't only rely on the cloud to collect and utilize this data because many applications still require dedicated infrastructure for security, redundancy, performance, etc. In his session at 17th Cloud Expo, Emil Sayegh, CEO of Codero Hosting, will discuss how in order to resolve the inherent issues, companies need to combine dedicated and cloud solutions through hybrid hosting – a sustainable solution for the data required to manage I...
Apps and devices shouldn't stop working when there's limited or no network connectivity. Learn how to bring data stored in a cloud database to the edge of the network (and back again) whenever an Internet connection is available. In his session at 17th Cloud Expo, Bradley Holt, Developer Advocate at IBM Cloud Data Services, will demonstrate techniques for replicating cloud databases with devices in order to build offline-first mobile or Internet of Things (IoT) apps that can provide a better, faster user experience, both offline and online. The focus of this talk will be on IBM Cloudant, Apa...
WebRTC is about the data channel as much as about video and audio conferencing. However, basically all commercial WebRTC applications have been built with a focus on audio and video. The handling of “data” has been limited to text chat and file download – all other data sharing seems to end with screensharing. What is holding back a more intensive use of peer-to-peer data? In her session at @ThingsExpo, Dr Silvia Pfeiffer, WebRTC Applications Team Lead at National ICT Australia, will look at different existing uses of peer-to-peer data sharing and how it can become useful in a live session to...
As a company adopts a DevOps approach to software development, what are key things that both the Dev and Ops side of the business must keep in mind to ensure effective continuous delivery? In his session at DevOps Summit, Mark Hydar, Head of DevOps, Ericsson TV Platforms, will share best practices and provide helpful tips for Ops teams to adopt an open line of communication with the development side of the house to ensure success between the two sides.
SYS-CON Events announced today that IBM Cloud Data Services has been named “Bronze Sponsor” of SYS-CON's 17th Cloud Expo, which will take place on November 3–5, 2015, at the Santa Clara Convention Center in Santa Clara, CA. IBM Cloud Data Services offers a portfolio of integrated, best-of-breed cloud data services for developers focused on mobile computing and analytics use cases.
"Matrix is an ambitious open standard and implementation that's set up to break down the fragmentation problems that exist in IP messaging and VoIP communication," explained John Woolf, Technical Evangelist at Matrix, in this interview at @ThingsExpo, held Nov 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA.
WebRTC has had a real tough three or four years, and so have those working with it. Only a few short years ago, the development world were excited about WebRTC and proclaiming how awesome it was. You might have played with the technology a couple of years ago, only to find the extra infrastructure requirements were painful to implement and poorly documented. This probably left a bitter taste in your mouth, especially when things went wrong.