Click here to close now.


Eclipse Authors: Liz McMillan, XebiaLabs Blog, Ken Fogel, Sematext Blog, Marcin Warpechowski

Blog Feed Post

The CTOvision Big Data Top Enterprise Tech List: The most important data technologies to accelerate into your infrastructure


We produced this list as an aid to the enterprise CTO seeking information on the most capable mission-enabling infrastructure technologies. This is a companion piece to our list of the top analytical technologists on our Analyst One website. Our methodologies are at the bottom of this list.

We trust you will find this list interesting and informative. Have some new technologies to suggest for the list? Let us know at our Contact Page.


The CTOvision Big Data Top Enterprise Technologies List

Aerospike: Aerospike delivers the first flash-optimized in-memory database and the most reliable NoSQL database for revenue critical, real-time big data applications. The database of choice in advertising, Aerospike is the user store and system of engagement for Internet-scale, interaction platforms, such as AppNexus, Bluekai, eXelate, The Trade Desk and [x+1], predictably processing terabytes of data and billions of transactions per day, with 10x better performance, 10x fewer servers and zero downtime. Developers in mobile, video, gaming, social, ecommerce, retail and more can create the most compelling interactions extending Aerospike to fit their applications. Aerospike is headquartered in Silicon Valley; investors include Alsop Louie, Draper Associates and NEA.
Appfluent: Appfluent provides IT organizations with visibility into usage and performance of data warehouse and business intelligence systems. IT decision makers can view exactly which enterprise data is being used or not used, determine how business intelligence is performing and identify causes of database performance issues. With Appfluent, customers can address exploding data growth and start the smart move to Hadoop and Big Data.

Arista Networks
Arista Networks: Arista Networks was founded to deliver networking solutions for large data center and HPC environments and delivers a portfolio of Gigabit and 10GBE switches that redefine network architectures, brings extensibility to networking and dramatically changes the price/performance of data center networks. At the core of Arista’s platform is the Extensible Operating System (EOS™), a ground-breaking network operating system with single-image consistency across hardware platforms, and modern core architecture enabling in-service upgrades and application extensibility.

Azul Systems
Azul Systems: Azul Zing™ is essential technology for Big Data applications that are critical to business results. Zing is the only Java performance solution that delivers both very low latency and high sustained throughput for real-time analytics and self-service business intelligence. With Zing your Big Data applications can utilize massive in-memory datasets while delivering predictable performance, allowing reports to be run on more live data with faster results. Zing even reduces or eliminates the need for extra caching applications.

Basho Technologies
Basho Technologies: Basho Technologies is the creator and developer of Riak, an open-source distributed database, providing extreme high-availability, fault-tolerance, and operational simplicity even at scale. Riak has rapidly gained adoption throughout the Fortune 100 and has become foundational to many of the world’s fastest-growing Web-based, mobile and social applications.

Cloudera: Cloudera pioneered the business case for Hadoop with CDH, the world’s most comprehensive, tested and widely deployed distribution of Hadoop. Its Platform for Big Data, Cloudera Enterprise, empowers enterprises to Ask Bigger Questions™ and gain rich, actionable insights from all their data to derive real business value and competitive advantage. As the top contributor to the Apache open source community and leading educator of data professionals, with tens of thousands of nodes under management and hundreds of customers across diverse markets, Cloudera is the category leader that sets the standard for Hadoop in the enterprise.

Couchbase: Couchbase is a leading provider of NoSQL database technology and the company behind the Couchbase open source project. Couchbase Server, the company’s flagship product, is a NoSQL document-oriented database with production deployments at AOL, Cisco, Concur, LinkedIn, Orbitz,, Shuffle Master, Zynga and hundreds of other household names worldwide. It is particularly well suited for interactive applications, providing easy scalability, consistent high performance, 24×365 availability, and a flexible data model for ease of development.

Data Direct Networks
Data Direct Networks: DDN is the world’s largest, privately-held, data storage infrastructure provider. With a unique and exacting focus on the requirements of today’s massive unstructured data generators, DDN has innovated a comprehensive product portfolio for Big Data applications which are optimized for the world’s most data-intensive environments.  hScaler, the world’s first truly unified analytics appliance factory configured and solutions ready has the ability to be deployed in hours and answer queries in seconds.  hScaler can put you in charge of your big data while truly lowering your TCO.

Dataguise, Inc.
Dataguise, Inc.: Dataguise provides data privacy protection and risk assessment analytics allowing organizations to safely leverage and share enterprise data. Their solutions simplify governance by automatically protecting the data (masking or encryption) and providing actionable compliance intelligence. These capabilities simplify risk management and reduce regulatory compliance costs.

GridGain: GridGain develops scalable, distributed, in-memory data platform technology for real time data processing. The company’s Java-based middleware products enable development of applications and services that can instantly access terabytes to petabytes of information from any data source or file system, distribute computational tasks across any number of machines, and produce results orders of magnitude faster than traditionally architected systems. GridGain’s customers include innovative web and mobile businesses, leading Fortune 500 companies, and top government agencies. The company is headquartered in Foster City, California.

Hadapt: Hadapt has developed the industry’s only Big Data analytic platform natively integrating SQL with Apache Hadoop. The unification of these traditionally segregated platforms enables customers to analyze all of their data (structured, semi-structured and unstructured) in a single platform-no connectors, complexities or rigid structure. The company’s core technology began as research in the Yale Computer Science department under co-founders Dr. Daniel Abadi and Ph.D student Kamil Bajda-Pawlikowski. In 2011, led by co-founder and CEO Justin Borgman, Hadapt raised $9.5MM Series A round of funding from Bessemer Venture Partners and Norwest Venture Partners. The company is headquartered in Cambridge, MA.

Jaspersoft: Jaspersoft empowers millions of people every day to make faster decisions by bringing them timely, actionable data inside their apps and business processes. Its embeddable, cost-effective reporting and analytics platform allows anyone to quickly self-serve and get the answers they need and scales architecturally and economically to reach everyone.

Kognitio: Kognitio is an in-memory analytical platform that can be tightly integrated with Hadoop for high-performance advanced analytics that make Big Data more consumable for enterprises, especially those with mature BI environments or engrained tools. An MPP platform itself, it enables ad-hoc queries in real-time, wrapped in industry-standard SQL for easy dissemination without MapReduce. Parallelizing standard binary languages like R and Python to run statistical and algorithmic functions in-memory, it is used by Data Scientists, BI professionals and Systems/Database Administrators to give fast access to data that persists in Hadoop and other data storage layers, enabling a Logical Data Warehouse model.

LucidWorks: LucidWorks, the trusted name in Search, Discovery and Analytics, transforms the way people access information to enable data-driven decisions. Leveraging both structured and unstructured data built on the power of Apache Lucene/Solr open source search, LucidWorks delivers unmatched stability, scalability, and time-to-delivery for search applications. LucidWorks Search provides ease of use development to access up to billions of documents with sub-second query and faceting response time. LucidWorks Big Data tightly integrates key Apache projects needed to build and deploy applications providing ubiquitous access to the data trapped inside Hadoop.

Mellanox Technologies
Mellanox Technologies: Mellanox Technologies (NASDAQ: MLNX, TASE: MLNX) is a leading supplier of end-to-end InfiniBand and Ethernet interconnect solutions and services for servers and storage. Mellanox interconnect solutions increase data center efficiency by providing the highest throughput and lowest latency, delivering data faster to applications and unlocking system performance capability. Mellanox offers a choice of fast interconnect products: adapters, switches, software and silicon that accelerate application runtime and maximize business results for a wide range of markets including high performance computing, enterprise data centers, Web 2.0, cloud, storage and financial services. More information is available at Founded in 1999, Mellanox Technologies is headquartered in Sunnyvale, California and Yokneam, Israel.

MemSQL: MemSQL is a distributed database for real-time analytics. Data scientists, analysts, and developers can query high velocity workloads and historical data simultaneously, all through a convenient SQL interface. By combining significant speed and throughput advantages with complex analytics, an enterprise can gain instant insight to their business and stay competitive in a fast-moving environment.

MetaScale: An early adopter of big data and legacy modernization initiatives, MetaScale provides cutting-edge technologies, Hadoop training and technology solutions to its customers. As a subsidiary of Sears Holdings Corporation, we understand the value of heritage and the need for constant innovation to drive growth. Through this heritage, we offer a deep understanding of employing complex big data tools to solve traditional business problems in the enterprise. Our team brings extensive experience in the migration of workloads off mainframe, large-scale private open-source cloud computing, Hadoop for big data BI and legacy infrastructure modernization.

MongoDB: MongoDB (from humongous) is reinventing data management and powering big data as the leading NoSQL database. Designed for how we build and run applications today, it empowers organizations to be more agile and scalable. MongoDB enables new types of applications, better customer experience, faster time to market and lower costs. It has a thriving global community with over 4 million downloads, 100,000 online education registrations, 20,000 user group members and 20,000 MongoDB Days attendees. The company has more than 600 customers, including many of the world’s largest organizations.

Optensity: provides AppSymphony. AppSymphony is a platform that enables businesses and government organizations to exploit big data sources while leveraging scalable computing environments and their current workforce.  AppSymphony’s execution engine runs across a variety of compute environments including Amazon EC2, Rackspace, and Google Compute Engine.  Once an analytic workflow, or “App”, has been authored and validated, it is discoverable and useable by anyone else in the enterprise, maximizing the App’s utility to the entire organization.
Pentaho: Pentaho is building the future of business analytics. Pentaho’s open source heritage drives our continued innovation in a modern, integrated, embeddable platform built for accessing all data sources. With support for all of the leading Hadoop distributions, NoSQL databases and high performance analytic databases, Pentaho provides the broadest support for big data analytics, as well as integration and orchestration of big data and traditional sources.

Platfora: Platfora’s mission is to empower customers to transform their businesses into fact-based enterprises. Platfora masks the complexity of Hadoop, making it easy for customers to understand all the facts in their business across events, actions, behaviors and time. For more details, visit or follow @platfora and #FactBased on twitter.

Progress DataDirect
Progress DataDirect: Progress DataDirect is the world leader in data connectivity, offering the most comprehensive software solutions for connecting the world’s most critical applications to data and services, running on any platform, using proven and emerging standards. Progress Software’s DataDirect Cloud product helps you address the challenges associated with cloud data connectivity by providing a managed service offering that delivers standards based SQL connectivity to a broad spectrum of SaaS, Big Data, Social, and NoSQL data sources. With a proven, 20-year history, strong technical leadership and robust product line, software architects worldwide depend on Progress Software’s DataDirect line of products to connect their applications to an unparalleled range of data sources using standard-based interfaces such as ODBC, JDBC, ADO.NET, XQuery and SOAP.

Protegrity: Protegrity, the innovative leader of groundbreaking enterprise data security software, provides high performance, infinitely scalable end-to-end data security solutions for organizations worldwide. Protegrity helps its customers secure all of their sensitive data in Hadoop and across the enterprise, ensuring compliance with all PCI, PHI and Privacy regulations. Protegrity’s solutions give corporations the ability to implement a variety of data protection methods, including vaultless tokenization, strong encryption, masking and monitoring to ensure the protection of their sensitive data.

Rogue Wave Software
Rogue Wave Software: Rogue Wave Software is the largest independent provider of cross-platform software development tools and embedded components for the next generation of HPC applications. Offering a broad portfolio, Rogue Wave enables developers to increase productivity and harness the power of multicore computing while reducing the complexity of developing multi-processor and data-intensive applications. With Rogue Wave’s IMSL Numerical Libraries, businesses and organizations reduce development time, realize a lower total cost of ownership, and improve quality and maintainability. The robust and portable collection of embeddable math and statistical functions available in native C, C++, C#, Fortran, and Java™ provide sophisticated analytics for high-performance, mission-critical applications.

SGI: SGI, the trusted leader in technical computing, helps customers solve their most demanding business and technology challenges by delivering high performance computing (HPC), Big Data, and data storage solutions that accelerate time to discovery, innovation, and profitability. Delivering extreme speed, scale, and efficiency, SGI server and storage offerings are utilized by scientific, business, and government communities to solve challenging, data-intensive computing and data management problems, typically requiring large amounts of computing power and fast and efficient data movement both within the computing system and to and from large-scale data storage installations.
SiSense: SiSense Prism is a Big Data Analytics Solution that provides the benefits of In-Memory without its disadvantages. SiSense In-Memory Columnar Datastore analyzes 100 times more data at 10 times the speed of comparable solutions. No need to set up complex data warehouse systems or OLAP cubes. No need for programming either, regardless where data comes from or how big it is.

Skytree Inc.
Skytree Inc.: Skytree’s Machine Learning platform gives organizations the power to discover deep analytic insights, predict future trends, make recommendations and reveal untapped markets and customers. Predictive Analytics and Machine Learning are quickly becoming must-have technologies in the age of Big Data, and Skytree provides the Enterprise-grade foundation. Skytree’s flagship product – Skytree Server – is the only general purpose scalable Machine Learning system on the market, built for the highest accuracy at unprecedented speed and scale.

SoftwareAG: provides big data tools and infrastructure including Enterprise Ehcache. Enterprise Ehcache. Enterprise Ehcache snaps into enterprise applications for a faster, easier, more broadly applicable approach to achieving high-performance scalability. Based on the de facto caching standard for enterprise Java, Enterprise Ehcache is an easy-to-deploy solution for hard-to-solve problems. With just a few config changes, you can: Achieve 10-times improvement in application response times, Gain headroom for terabytes of data growth, Offload slow, expensive databases or mainframes, Save on licensing, administration and hardware costs.
Splunk: Splunk Inc. (NASDAQ: SPLK) provides the engine for machine data. Splunk software collects, indexes and harnesses the machine-generated big data coming from the websites, applications, servers, networks and mobile devices that power business. Splunk software enables organizations to monitor, search, analyze, visualize and act on massive streams of real-time and historical machine data. More than 4,800 enterprises, universities, government agencies and service providers in over 80 countries use Splunk Enterprise to gain Operational Intelligence that deepens business and customer understanding, improves service and uptime, reduces cost and mitigates cyber-security risk. Splunk Storm, a cloud-based subscription service, is used by organizations developing applications in the cloud.

Sqrrl: Sqrrl is a Big Data software company whose employees have dealt with the world’s largest, most complex, and most sensitive datasets for the last decade. Sqrrl’s software product, Sqrrl Enterprise, is the most secure and scalable Big Data platform for building real-time analytical applications and is powered by Apache Accumulo™ and Hadoop. Sqrrl Enterprise extends the capabilities of Accumulo with additional data ingest, security, and real-time analytical features that help unlock the power of Big Data.

Zettaset: Zettaset, the leader in secure Big Data management, automates, accelerates, and simplifies Hadoop deployment for the enterprise. Zettaset Orchestrator&tade; is the only Big Data management solution designed to address enterprise requirements for security, high availability, manageability and scalability in a distributed computing environment. Orchestrator helps organizations move Hadoop from pilot into production, replacing open source management with a more robust approach that easily fits into existing enterprise security and policy frameworks. Zettaset Orchestrator provides comprehensive fail-over for all critical cluster services, facilitates integration with the most widely adopted ETL and analytics applications, and is compatible with the leading Hadoop distributions.


Our Methodologies 

We firmly believe that technologies must be supported by strong companies, so we focus on companies with proven ability to serve in real enterprises. In most cases we select VC backed firms because those come with staying power. We love open source, but open source solutions should also be supported by a strong firm. We also believe it is important to only report on firms that have products that are really available now (no vaporware).  Additionally, we believe most firms that have a capability that can make a difference for the modern analyst will be interested in demonstrating that capability at Hadoop World. This last assumption allowed us to get a jumpstart on our first list. We started our process by reviewing the full list of sponsors and exhibitors at the coming Hadoop World (for a full list of all exhibitors see here). We then reviewed previous research at our and sites to round out this initial list.

We know our methodology has some holes. But as good analysts we are going to keep our eyes and ears open for other technologies we can report on and will modify this list as required. We also know we have you, dear readers, to check our assumptions and give us feedback on the list. If you have or know of a firm we should consider for this, let us know.



Read the original blog entry...

More Stories By Bob Gourley

Bob Gourley, former CTO of the Defense Intelligence Agency (DIA), is Founder and CTO of Crucial Point LLC, a technology research and advisory firm providing fact based technology reviews in support of venture capital, private equity and emerging technology firms. He has extensive industry experience in intelligence and security and was awarded an intelligence community meritorious achievement award by AFCEA in 2008, and has also been recognized as an Infoworld Top 25 CTO and as one of the most fascinating communicators in Government IT by GovFresh.

@ThingsExpo Stories
The buzz continues for cloud, data analytics and the Internet of Things (IoT) and their collective impact across all industries. But a new conversation is emerging - how do companies use industry disruption and technology enablers to lead in markets undergoing change, uncertainty and ambiguity? Organizations of all sizes need to evolve and transform, often under massive pressure, as industry lines blur and merge and traditional business models are assaulted and turned upside down. In this new data-driven world, marketplaces reign supreme while interoperability, APIs and applications deliver un...
The Internet of Things (IoT) is growing rapidly by extending current technologies, products and networks. By 2020, Cisco estimates there will be 50 billion connected devices. Gartner has forecast revenues of over $300 billion, just to IoT suppliers. Now is the time to figure out how you’ll make money – not just create innovative products. With hundreds of new products and companies jumping into the IoT fray every month, there’s no shortage of innovation. Despite this, McKinsey/VisionMobile data shows "less than 10 percent of IoT developers are making enough to support a reasonably sized team....
Electric power utilities face relentless pressure on their financial performance, and reducing distribution grid losses is one of the last untapped opportunities to meet their business goals. Combining IoT-enabled sensors and cloud-based data analytics, utilities now are able to find, quantify and reduce losses faster – and with a smaller IT footprint. Solutions exist using Internet-enabled sensors deployed temporarily at strategic locations within the distribution grid to measure actual line loads.
Too often with compelling new technologies market participants become overly enamored with that attractiveness of the technology and neglect underlying business drivers. This tendency, what some call the “newest shiny object syndrome,” is understandable given that virtually all of us are heavily engaged in technology. But it is also mistaken. Without concrete business cases driving its deployment, IoT, like many other technologies before it, will fade into obscurity.
You have your devices and your data, but what about the rest of your Internet of Things story? Two popular classes of technologies that nicely handle the Big Data analytics for Internet of Things are Apache Hadoop and NoSQL. Hadoop is designed for parallelizing analytical work across many servers and is ideal for the massive data volumes you create with IoT devices. NoSQL databases such as Apache HBase are ideal for storing and retrieving IoT data as “time series data.”
Today air travel is a minefield of delays, hassles and customer disappointment. Airlines struggle to revitalize the experience. GE and M2Mi will demonstrate practical examples of how IoT solutions are helping airlines bring back personalization, reduce trip time and improve reliability. In their session at @ThingsExpo, Shyam Varan Nath, Principal Architect with GE, and Dr. Sarah Cooper, M2Mi's VP Business Development and Engineering, will explore the IoT cloud-based platform technologies driving this change including privacy controls, data transparency and integration of real time context w...
The Internet of Everything is re-shaping technology trends–moving away from “request/response” architecture to an “always-on” Streaming Web where data is in constant motion and secure, reliable communication is an absolute necessity. As more and more THINGS go online, the challenges that developers will need to address will only increase exponentially. In his session at @ThingsExpo, Todd Greene, Founder & CEO of PubNub, will explore the current state of IoT connectivity and review key trends and technology requirements that will drive the Internet of Things from hype to reality.
The IoT market is on track to hit $7.1 trillion in 2020. The reality is that only a handful of companies are ready for this massive demand. There are a lot of barriers, paint points, traps, and hidden roadblocks. How can we deal with these issues and challenges? The paradigm has changed. Old-style ad-hoc trial-and-error ways will certainly lead you to the dead end. What is mandatory is an overarching and adaptive approach to effectively handle the rapid changes and exponential growth.
Today’s connected world is moving from devices towards things, what this means is that by using increasingly low cost sensors embedded in devices we can create many new use cases. These span across use cases in cities, vehicles, home, offices, factories, retail environments, worksites, health, logistics, and health. These use cases rely on ubiquitous connectivity and generate massive amounts of data at scale. These technologies enable new business opportunities, ways to optimize and automate, along with new ways to engage with users.
The IoT is upon us, but today’s databases, built on 30-year-old math, require multiple platforms to create a single solution. Data demands of the IoT require Big Data systems that can handle ingest, transactions and analytics concurrently adapting to varied situations as they occur, with speed at scale. In his session at @ThingsExpo, Chad Jones, chief strategy officer at Deep Information Sciences, will look differently at IoT data so enterprises can fully leverage their IoT potential. He’ll share tips on how to speed up business initiatives, harness Big Data and remain one step ahead by apply...
There will be 20 billion IoT devices connected to the Internet soon. What if we could control these devices with our voice, mind, or gestures? What if we could teach these devices how to talk to each other? What if these devices could learn how to interact with us (and each other) to make our lives better? What if Jarvis was real? How can I gain these super powers? In his session at 17th Cloud Expo, Chris Matthieu, co-founder and CTO of Octoblu, will show you!
SYS-CON Events announced today that ProfitBricks, the provider of painless cloud infrastructure, will exhibit at SYS-CON's 17th International Cloud Expo®, which will take place on November 3–5, 2015, at the Santa Clara Convention Center in Santa Clara, CA. ProfitBricks is the IaaS provider that offers a painless cloud experience for all IT users, with no learning curve. ProfitBricks boasts flexible cloud servers and networking, an integrated Data Center Designer tool for visual control over the cloud and the best price/performance value available. ProfitBricks was named one of the coolest Clo...
As a company adopts a DevOps approach to software development, what are key things that both the Dev and Ops side of the business must keep in mind to ensure effective continuous delivery? In his session at DevOps Summit, Mark Hydar, Head of DevOps, Ericsson TV Platforms, will share best practices and provide helpful tips for Ops teams to adopt an open line of communication with the development side of the house to ensure success between the two sides.
SYS-CON Events announced today that IBM Cloud Data Services has been named “Bronze Sponsor” of SYS-CON's 17th Cloud Expo, which will take place on November 3–5, 2015, at the Santa Clara Convention Center in Santa Clara, CA. IBM Cloud Data Services offers a portfolio of integrated, best-of-breed cloud data services for developers focused on mobile computing and analytics use cases.
SYS-CON Events announced today that Sandy Carter, IBM General Manager Cloud Ecosystem and Developers, and a Social Business Evangelist, will keynote at the 17th International Cloud Expo®, which will take place on November 3–5, 2015, at the Santa Clara Convention Center in Santa Clara, CA.
Developing software for the Internet of Things (IoT) comes with its own set of challenges. Security, privacy, and unified standards are a few key issues. In addition, each IoT product is comprised of at least three separate application components: the software embedded in the device, the backend big-data service, and the mobile application for the end user's controls. Each component is developed by a different team, using different technologies and practices, and deployed to a different stack/target - this makes the integration of these separate pipelines and the coordination of software upd...
Mobile messaging has been a popular communication channel for more than 20 years. Finnish engineer Matti Makkonen invented the idea for SMS (Short Message Service) in 1984, making his vision a reality on December 3, 1992 by sending the first message ("Happy Christmas") from a PC to a cell phone. Since then, the technology has evolved immensely, from both a technology standpoint, and in our everyday uses for it. Originally used for person-to-person (P2P) communication, i.e., Sally sends a text message to Betty – mobile messaging now offers tremendous value to businesses for customer and empl...
"Matrix is an ambitious open standard and implementation that's set up to break down the fragmentation problems that exist in IP messaging and VoIP communication," explained John Woolf, Technical Evangelist at Matrix, in this interview at @ThingsExpo, held Nov 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA.
WebRTC converts the entire network into a ubiquitous communications cloud thereby connecting anytime, anywhere through any point. In his session at WebRTC Summit,, Mark Castleman, EIR at Bell Labs and Head of Future X Labs, will discuss how the transformational nature of communications is achieved through the democratizing force of WebRTC. WebRTC is doing for voice what HTML did for web content.
Nowadays, a large number of sensors and devices are connected to the network. Leading-edge IoT technologies integrate various types of sensor data to create a new value for several business decision scenarios. The transparent cloud is a model of a new IoT emergence service platform. Many service providers store and access various types of sensor data in order to create and find out new business values by integrating such data.