Talend, a global open source software leader, today announced the availability of version 5.4 of its next-generation integration platform, natively optimised to run inside Hadoop by leveraging MapReduce 2.0, also known as YARN. Setting itself apart as the only integration solution running inside Hadoop, version 5.4 of Talend for Big Data enables data-driven organisations to capitalise on all data assets in ways never thought possible, thanks to the power of open source-driven innovation.
“At the forefront of a big data paradigm shift, Talend invests heavily in building the integration platform of tomorrow, leveraging the benefits of open source for enterprise clients,” said Fabrice Bonan, CTO and co-founder of Talend. “With the advent of YARN, Hadoop is truly becoming a computing platform that goes well beyond its early use cases. With Talend v5.4, we are providing customers with the tools they need to unleash the power of Hadoop to fully leverage their total data and use it as a strategic asset, for any type of value-added project or application.”
"VHA is using Talend's platform to aggregate, reconcile, cleanse and deliver large and complex data sets from extremely diverse sources," said Lloyd Mangnall, vice president, architecture and quality at VHA. "Talend's ability to take full advantage of modern data architectures to master and transform data 'in place' is truly ground-breaking. We are looking forward to using the new data quality features running natively inside our Hadoop and NoSQL clusters, fully leveraging big data systems to support master data management."
With version 5.4, Talend continues to break ground by providing unique and unmatched capabilities to accelerate and increase value of big data projects:
- YARN as the Engine: Apache Hadoop has introduced some major innovations, including YARN (also known as MapReduce 2.0). With version 5.4, Talend enables the native use of YARN as the data integration engine, allowing customers to immediately benefit from the superior resource management that YARN offers. Unlike 3rd party engines that sit on top of Hadoop, Talend v5.4 users directly benefit from the massive scalability and elasticity built into the Hadoop platform with zero overhead in scheduling, job management or platform deployment & maintenance.
- Visual Optimisation of MapReduce Jobs: With version 5.4, Talend is the only vendor that enables developers to determine the most optimal way to design a MapReduce job before they run it in production. Statistics and indicators are directly available in the design environment to guide developers through the optimisation process that is performed against a sample of the actual data set.
- Hadoop for Data Cleansing: Building on its commitment to data quality as being critical to data integration, Talend v5.4 supports big data quality generated as MapReduce jobs, making it possible for computationally intensive data quality tasks to be performed at an infinite scale. In addition to the already supported profiling sources which included Hadoop, Hadoop Hive and all major relational databases, Talend v5.4 adds profiling for HP Vertica and exposes the profiling connection information so that additional connections can be built by the Community.
- Support for Hadoop Security with Kerberos: Organisations seeking to adopt big data are facing new challenges of management, government and security. To address this, Talend v5.4 adds support for Kerberos, the number one security framework for Hadoop, making Hadoop distributions more secure.
Service Delivery Executive
Tel: +44 (0)1522 883640