Eclipse Authors: Pat Romanski, Elizabeth White, Liz McMillan, David H Deans, JP Morgenthal

Blog Feed Post

Talend Reinforces Leadership in Big Data Integration with Support for YARN

Latest Version of Enterprise Integration Platform Fully Embraces Hadoop as a Computing Platform

Talend, a global open source software leader, today announced the availability of version 5.4 of its next-generation integration platform, natively optimised to run inside Hadoop by leveraging MapReduce 2.0, also known as YARN. Setting itself apart as the only integration solution running inside Hadoop, version 5.4 of Talend for Big Data enables data-driven organisations to capitalise on all data assets in ways never thought possible, thanks to the power of open source-driven innovation.

"At the forefront of a big data paradigm shift, Talend invests heavily in building the integration platform of tomorrow, leveraging the benefits of open source for enterprise clients," said Fabrice Bonan, CTO and co-founder of Talend. "With the advent of YARN, Hadoop is truly becoming a computing platform that goes well beyond its early use cases. With Talend v5.4, we are providing customers with the tools they need to unleash the power of Hadoop to fully leverage their total data and use it as a strategic asset, for any type of value-added project or application."

"VHA is using Talend's platform to aggregate, reconcile, cleanse and deliver large and complex data sets from extremely diverse sources," said Lloyd Mangnall, vice president, architecture and quality at VHA. "Talend's ability to take full advantage of modern data architectures to master and transform data 'in place' is truly ground-breaking. We are looking forward to using the new data quality features running natively inside our Hadoop and NoSQL clusters, fully leveraging big data systems to support master data management."

With version 5.4, Talend continues to break ground by providing unique and unmatched capabilities to accelerate and increase value of big data projects:

  • YARN as the Engine: Apache Hadoop has introduced some major innovations, including YARN (also known as MapReduce 2.0). With version 5.4, Talend enables the native use of YARN as the data integration engine, allowing customers to immediately benefit from the superior resource management that YARN offers. Unlike 3rd party engines that sit on top of Hadoop, Talend v5.4 users directly benefit from the massive scalability and elasticity built into the Hadoop platform with zero overhead in scheduling, job management or platform deployment & maintenance.
  • Visual Optimisation of MapReduce Jobs: With version 5.4, Talend is the only vendor that enables developers to determine the most optimal way to design a MapReduce job before they run it in production. Statistics and indicators are directly available in the design environment to guide developers through the optimisation process that is performed against a sample of the actual data set.
  • Hadoop for Data Cleansing: Building on its commitment to data quality as being critical to data integration, Talend v5.4 supports big data quality generated as MapReduce jobs, making it possible for computationally intensive data quality tasks to be performed at an infinite scale. In addition to the already supported profiling sources which included Hadoop, Hadoop Hive and all major relational databases, Talend v5.4 adds profiling for HP Vertica and exposes the profiling connection information so that additional connections can be built by the Community.
  • Support for Hadoop Security with Kerberos: Organisations seeking to adopt big data are facing new challenges of management, government and security. To address this, Talend v5.4 adds support for Kerberos, the number one security framework for Hadoop, making Hadoop distributions more secure.

Commitment to the Talend Partner Ecosystem
In addition to supporting the latest platforms from partners such as Hortonworks, Cloudera and MapR, Talend is also extending support for SAP HANA, Pivotal HD and IBM PureData System for Hadoop.

Talend also continues to enhance its NoSQL support with support for batch data loads to MongoDB and Cassandra, as well as new connectors for Riak, further underscoring the company's leadership position as the leading solution for NoSQL and big data integration.

"Talend has a strong track record of world-class innovation in the data integration market," said William McKnight, president of McKnight Consulting Group. "The ability of Talend's solutions to evolve within a quickly changing technology landscape enables organisations to remove integration barriers to the adoption of modern data platforms such as NoSQL or Hadoop. The support of YARN as the integration engine, and the Hadoop visual job optimisation features introduced in Talend v5.4, will provide even more agility and choices for users."

Version 5.4 of Talend Open Studio for Data Integration, Talend Open Studio for Data Quality, Talend Open Studio for MDM, Talend Open Studio for ESB and Talend Open Studio for Big Data are available for immediate download from Talend's website, www.talend.com. Version 5.4 of the commercial subscription products will be available within six weeks and will be provided to all existing Talend customers as part of their subscription agreement and can be procured through the usual Talend representatives or partners.

About Talend
From small projects to enterprise-wide implementations, Talend's highly-scalable data, application and business process integration platform maximises the value of an organisation's information assets and optimises return on investment through a usage-based subscription model. Ready for big data environments, Talend's flexible architecture easily adapts to future IT platforms. And a common set of easy-to-use tools implemented across all Talend products enable teams to scale developer skillsets, too.

More than 4,000 enterprise customers worldwide leverage Talend's solutions and services. The company has major offices in North America, Europe and Asia, and a global network of technical and services partners. For more information, please visit www.talend.com.

Source: RealWire

Read the original blog entry...

More Stories By RealWire News Distribution

RealWire is a global news release distribution service specialising in the online media. The RealWire approach focuses on delivering relevant content to the receivers of our client's news releases. As we know that it is only through delivering relevance, that influence can ever be achieved.

IoT & Smart Cities Stories
IoT is rapidly becoming mainstream as more and more investments are made into the platforms and technology. As this movement continues to expand and gain momentum it creates a massive wall of noise that can be difficult to sift through. Unfortunately, this inevitably makes IoT less approachable for people to get started with and can hamper efforts to integrate this key technology into your own portfolio. There are so many connected products already in place today with many hundreds more on the h...
CloudEXPO New York 2018, colocated with DXWorldEXPO New York 2018 will be held November 11-13, 2018, in New York City and will bring together Cloud Computing, FinTech and Blockchain, Digital Transformation, Big Data, Internet of Things, DevOps, AI, Machine Learning and WebRTC to one location.
SYS-CON Events announced today that IoT Global Network has been named “Media Sponsor” of SYS-CON's @ThingsExpo, which will take place on June 6–8, 2017, at the Javits Center in New York City, NY. The IoT Global Network is a platform where you can connect with industry experts and network across the IoT community to build the successful IoT business of the future.
Andrew Keys is Co-Founder of ConsenSys Enterprise. He comes to ConsenSys Enterprise with capital markets, technology and entrepreneurial experience. Previously, he worked for UBS investment bank in equities analysis. Later, he was responsible for the creation and distribution of life settlement products to hedge funds and investment banks. After, he co-founded a revenue cycle management company where he learned about Bitcoin and eventually Ethereal. Andrew's role at ConsenSys Enterprise is a mul...
DXWorldEXPO | CloudEXPO are the world's most influential, independent events where Cloud Computing was coined and where technology buyers and vendors meet to experience and discuss the big picture of Digital Transformation and all of the strategies, tactics, and tools they need to realize their goals. Sponsors of DXWorldEXPO | CloudEXPO benefit from unmatched branding, profile building and lead generation opportunities.
Disruption, Innovation, Artificial Intelligence and Machine Learning, Leadership and Management hear these words all day every day... lofty goals but how do we make it real? Add to that, that simply put, people don't like change. But what if we could implement and utilize these enterprise tools in a fast and "Non-Disruptive" way, enabling us to glean insights about our business, identify and reduce exposure, risk and liability, and secure business continuity?
The best way to leverage your Cloud Expo presence as a sponsor and exhibitor is to plan your news announcements around our events. The press covering Cloud Expo and @ThingsExpo will have access to these releases and will amplify your news announcements. More than two dozen Cloud companies either set deals at our shows or have announced their mergers and acquisitions at Cloud Expo. Product announcements during our show provide your company with the most reach through our targeted audiences.
DXWorldEXPO LLC announced today that Telecom Reseller has been named "Media Sponsor" of CloudEXPO | DXWorldEXPO 2018 New York, which will take place on November 11-13, 2018 in New York City, NY. Telecom Reseller reports on Unified Communications, UCaaS, BPaaS for enterprise and SMBs. They report extensively on both customer premises based solutions such as IP-PBX as well as cloud based and hosted platforms.
To Really Work for Enterprises, MultiCloud Adoption Requires Far Better and Inclusive Cloud Monitoring and Cost Management … But How? Overwhelmingly, even as enterprises have adopted cloud computing and are expanding to multi-cloud computing, IT leaders remain concerned about how to monitor, manage and control costs across hybrid and multi-cloud deployments. It’s clear that traditional IT monitoring and management approaches, designed after all for on-premises data centers, are falling short in ...
The deluge of IoT sensor data collected from connected devices and the powerful AI required to make that data actionable are giving rise to a hybrid ecosystem in which cloud, on-prem and edge processes become interweaved. Attendees will learn how emerging composable infrastructure solutions deliver the adaptive architecture needed to manage this new data reality. Machine learning algorithms can better anticipate data storms and automate resources to support surges, including fully scalable GPU-c...