Click here to close now.


Eclipse Authors: Liz McMillan, XebiaLabs Blog, Ken Fogel, Sematext Blog, Marcin Warpechowski

Blog Feed Post

Talend Simplifies Big Data Further with New Release of Enterprise Open Source Integration Platform

Open source software leader advances next-generation integration solution with big data profiling for Hadoop, support of major NoSQL databases and increased usability features

Maidenhead, UK - 5 November 2012 - Talend, a global open source software leader, today announced the availability of version 5.2 of its next-generation integration platform, the only offering that provides a unified environment for managing the entire lifecycle across data, application and process integration requirements. With version 5.2, Talend extends the industry's most flexible, scalable and adaptive integration platform with the introduction of key new capabilities, including big data profiling for Hadoop, support for widely-used and deployed NoSQL databases, and a set of improvements that increases product usability and performance across the entire platform.

Big Data Profiling
In its mission to democratise big data, Talend has focused extensively on solutions that make deploying and managing Apache Hadoop and related technologies simple, without requiring specific expertise in these areas. With version 5.2, Talend has taken its big data strategy a step further by adding big data profiling for Hadoop, providing companies with the ability to discover and understand data in Hadoop clusters. Among the typical problems associated with data quality are duplication, incompleteness and inconsistency, which create inefficiencies in data processing. Talend Platform for Big Data includes new capabilities for visibility into big data in all its forms and locations. These include the ability to analyse data in Hive databases on Hadoop "in place" without extraction and the ability to perform data hygiene tasks, including data cleansing, enrichment, matching and de-duplication directly inside the Hadoop cluster through Hadoop code generation.

Simplified NoSQL Integration with Hadoop
Talend 5.2 adds support for NoSQL databases in its integration solutions, Talend Platform for Big Data and Talend Open Studio for Big Data, with an initial set of connectors for Cassandra, HBase and MongoDB. Built on Talend's award-winning open source integration technology, Talend Open Studio for Big Data is a powerful and versatile open source solution for big data integration that natively supports Apache Hadoop, including connectors for Hadoop Distributed File System (HDFS), HCatalog, Hive, Oozie, Pig and Sqoop - in addition to the more than 450 connectors included natively in the product. As NoSQL has become the go-to technology for certain data architectures, the integration of these platforms into Talend's big data solution enables customers to use these new connectors to migrate and synchronise data between NoSQL databases and all other data stores and systems.

"Talend version 5.2 delivers on our vision of simplifying the development, integration and management of big data so that businesses can focus on using that data to make faster and more informed decisions," said Fabrice Bonan, co-founder and chief technical officer, Talend. "We provide the most powerful and versatile open source, big data solution to help organisations load, extract and improve disparate data while leveraging the massively parallel processing power of big data technologies including Apache Hadoop and leading NoSQL databases."

Latest Release of Talend's Integration Products
In addition to Talend's big data enhancements, Talend introduces version 5.2 of its flagship data integration products that leverage the Talend Unified Platform. New features focus on product usability, user productivity improvements and performance to provide a more robust and easier to use solution.

  • Talend Enterprise Data Integration - In v5.2, parallel execution of jobs can now leverage multi-core hardware. This new version also supports continuous integration between development, test and production environments and is integrated with open source build manager Maven.

  • Talend Enterprise Data Quality - Version 5.2 includes expanded address validation algorithms, precise e-mail validity detection, and native fraud detection capabilities. A new set of components allows customers to use Melissa Data to validate addresses.

  • Talend Enterprise MDM - Support for a wider range of enterprise architectures in v5.2 lowers the barrier to MDM adoption; organisations can now use their Oracle, MySQL, Derby or H2 databases as the underlying MDM data store.

  • Talend Enterprise ESB - In this version, Continuous Integration between development, test and production environments is now available. Version control system Nexus is also supported for versioning and deployment.

  • Talend Enterprise BPM - Talend v5.2 presents a fully integrated BPM engine into the Talend Runtime. Talend users only need to manage a single container, which can run data jobs, web services, REST applications and now BPM processes. With fewer moving parts in system environments and the flexibility to run multiple instances of different application types within the same container, the work of the IT administrator is significantly reduced in terms of management and maintenance of the software.

Version 5.2 of Talend Open Studio for Data Integration, Talend Open Studio for Data Quality, Talend Open Studio for MDM, Talend Open Studio for ESB and Talend Open Studio for Big Data are available for immediate download from Talend's web site Version 5.2 of the commercial subscription products, available before the end of 2012, will be provided to all existing Talend customers as part of their subscription agreement and can be procured through the usual Talend representatives or partners.

About Talend
Talend is the recognised market leader in open source integration solutions. The company's enterprise integration platform helps organisations minimise costs and maximise the value of data integration, ETL, data quality, master data management, application integration and business process management, while supporting their shift toward the Cloud and Big Data. More than 3,500 paying customers worldwide, including eBay, ING, The Weather Channel, Deutsche Post and Allianz, subscribe to Talend's solutions and services. With over 20 million downloads, Talend's products are the most trusted integration solutions in the world. The company has major offices in North America, Europe and Asia, and a global network of technical and services partners. For more information, please visit


PR Contacts:
Selene Regan
[email protected]

Tom Webb
01252 727313
[email protected]

Read the original blog entry...

More Stories By RealWire News Distribution

RealWire is a global news release distribution service specialising in the online media. The RealWire approach focuses on delivering relevant content to the receivers of our client's news releases. As we know that it is only through delivering relevance, that influence can ever be achieved.

@ThingsExpo Stories
As more intelligent IoT applications shift into gear, they’re merging into the ever-increasing traffic flow of the Internet. It won’t be long before we experience bottlenecks, as IoT traffic peaks during rush hours. Organizations that are unprepared will find themselves by the side of the road unable to cross back into the fast lane. As billions of new devices begin to communicate and exchange data – will your infrastructure be scalable enough to handle this new interconnected world?
This week, the team assembled in NYC for @Cloud Expo 2015 and @ThingsExpo 2015. For the past four years, this has been a must-attend event for MetraTech. We were happy to once again join industry visionaries, colleagues, customers and even competitors to share and explore the ways in which the Internet of Things (IoT) will impact our industry. Over the course of the show, we discussed the types of challenges we will collectively need to solve to capitalize on the opportunity IoT presents.
SYS-CON Events announced today that Dyn, the worldwide leader in Internet Performance, will exhibit at SYS-CON's 17th International Cloud Expo®, which will take place on November 3-5, 2015, at the Santa Clara Convention Center in Santa Clara, CA. Dyn is a cloud-based Internet Performance company. Dyn helps companies monitor, control, and optimize online infrastructure for an exceptional end-user experience. Through a world-class network and unrivaled, objective intelligence into Internet conditions, Dyn ensures traffic gets delivered faster, safer, and more reliably than ever.
SYS-CON Events announced today that Sandy Carter, IBM General Manager Cloud Ecosystem and Developers, and a Social Business Evangelist, will keynote at the 17th International Cloud Expo®, which will take place on November 3–5, 2015, at the Santa Clara Convention Center in Santa Clara, CA.
SYS-CON Events announced today that Super Micro Computer, Inc., a global leader in high-performance, high-efficiency server, storage technology and green computing, will exhibit at the 17th International Cloud Expo®, which will take place on November 3–5, 2015, at the Santa Clara Convention Center in Santa Clara, CA. Supermicro (NASDAQ: SMCI), the leading innovator in high-performance, high-efficiency server technology is a premier provider of advanced server Building Block Solutions® for Data Center, Cloud Computing, Enterprise IT, Hadoop/Big Data, HPC and Embedded Systems worldwide. Supermi...
The Internet of Things (IoT) is growing rapidly by extending current technologies, products and networks. By 2020, Cisco estimates there will be 50 billion connected devices. Gartner has forecast revenues of over $300 billion, just to IoT suppliers. Now is the time to figure out how you’ll make money – not just create innovative products. With hundreds of new products and companies jumping into the IoT fray every month, there’s no shortage of innovation. Despite this, McKinsey/VisionMobile data shows "less than 10 percent of IoT developers are making enough to support a reasonably sized team....
The IoT market is on track to hit $7.1 trillion in 2020. The reality is that only a handful of companies are ready for this massive demand. There are a lot of barriers, paint points, traps, and hidden roadblocks. How can we deal with these issues and challenges? The paradigm has changed. Old-style ad-hoc trial-and-error ways will certainly lead you to the dead end. What is mandatory is an overarching and adaptive approach to effectively handle the rapid changes and exponential growth.
With major technology companies and startups seriously embracing IoT strategies, now is the perfect time to attend @ThingsExpo in Silicon Valley. Learn what is going on, contribute to the discussions, and ensure that your enterprise is as "IoT-Ready" as it can be! Internet of @ThingsExpo, taking place Nov 3-5, 2015, at the Santa Clara Convention Center in Santa Clara, CA, is co-located with 17th Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry players in the world. The Internet of Things (IoT) is the most profound change in personal an...
The IoT is upon us, but today’s databases, built on 30-year-old math, require multiple platforms to create a single solution. Data demands of the IoT require Big Data systems that can handle ingest, transactions and analytics concurrently adapting to varied situations as they occur, with speed at scale. In his session at @ThingsExpo, Chad Jones, chief strategy officer at Deep Information Sciences, will look differently at IoT data so enterprises can fully leverage their IoT potential. He’ll share tips on how to speed up business initiatives, harness Big Data and remain one step ahead by apply...
There will be 20 billion IoT devices connected to the Internet soon. What if we could control these devices with our voice, mind, or gestures? What if we could teach these devices how to talk to each other? What if these devices could learn how to interact with us (and each other) to make our lives better? What if Jarvis was real? How can I gain these super powers? In his session at 17th Cloud Expo, Chris Matthieu, co-founder and CTO of Octoblu, will show you!
Developing software for the Internet of Things (IoT) comes with its own set of challenges. Security, privacy, and unified standards are a few key issues. In addition, each IoT product is comprised of at least three separate application components: the software embedded in the device, the backend big-data service, and the mobile application for the end user's controls. Each component is developed by a different team, using different technologies and practices, and deployed to a different stack/target - this makes the integration of these separate pipelines and the coordination of software upd...
As a company adopts a DevOps approach to software development, what are key things that both the Dev and Ops side of the business must keep in mind to ensure effective continuous delivery? In his session at DevOps Summit, Mark Hydar, Head of DevOps, Ericsson TV Platforms, will share best practices and provide helpful tips for Ops teams to adopt an open line of communication with the development side of the house to ensure success between the two sides.
Today air travel is a minefield of delays, hassles and customer disappointment. Airlines struggle to revitalize the experience. GE and M2Mi will demonstrate practical examples of how IoT solutions are helping airlines bring back personalization, reduce trip time and improve reliability. In their session at @ThingsExpo, Shyam Varan Nath, Principal Architect with GE, and Dr. Sarah Cooper, M2Mi's VP Business Development and Engineering, will explore the IoT cloud-based platform technologies driving this change including privacy controls, data transparency and integration of real time context w...
The Internet of Everything is re-shaping technology trends–moving away from “request/response” architecture to an “always-on” Streaming Web where data is in constant motion and secure, reliable communication is an absolute necessity. As more and more THINGS go online, the challenges that developers will need to address will only increase exponentially. In his session at @ThingsExpo, Todd Greene, Founder & CEO of PubNub, will explore the current state of IoT connectivity and review key trends and technology requirements that will drive the Internet of Things from hype to reality.
"Matrix is an ambitious open standard and implementation that's set up to break down the fragmentation problems that exist in IP messaging and VoIP communication," explained John Woolf, Technical Evangelist at Matrix, in this interview at @ThingsExpo, held Nov 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA.
Nowadays, a large number of sensors and devices are connected to the network. Leading-edge IoT technologies integrate various types of sensor data to create a new value for several business decision scenarios. The transparent cloud is a model of a new IoT emergence service platform. Many service providers store and access various types of sensor data in order to create and find out new business values by integrating such data.
Too often with compelling new technologies market participants become overly enamored with that attractiveness of the technology and neglect underlying business drivers. This tendency, what some call the “newest shiny object syndrome,” is understandable given that virtually all of us are heavily engaged in technology. But it is also mistaken. Without concrete business cases driving its deployment, IoT, like many other technologies before it, will fade into obscurity.
There are so many tools and techniques for data analytics that even for a data scientist the choices, possible systems, and even the types of data can be daunting. In his session at @ThingsExpo, Chris Harrold, Global CTO for Big Data Solutions for EMC Corporation, will show how to perform a simple, but meaningful analysis of social sentiment data using freely available tools that take only minutes to download and install. Participants will get the download information, scripts, and complete end-to-end walkthrough of the analysis from start to finish. Participants will also be given the pract...
WebRTC services have already permeated corporate communications in the form of videoconferencing solutions. However, WebRTC has the potential of going beyond and catalyzing a new class of services providing more than calls with capabilities such as mass-scale real-time media broadcasting, enriched and augmented video, person-to-machine and machine-to-machine communications. In his session at @ThingsExpo, Luis Lopez, CEO of Kurento, will introduce the technologies required for implementing these ideas and some early experiments performed in the Kurento open source software community in areas ...
Electric power utilities face relentless pressure on their financial performance, and reducing distribution grid losses is one of the last untapped opportunities to meet their business goals. Combining IoT-enabled sensors and cloud-based data analytics, utilities now are able to find, quantify and reduce losses faster – and with a smaller IT footprint. Solutions exist using Internet-enabled sensors deployed temporarily at strategic locations within the distribution grid to measure actual line loads.