From MIKE2.0 Methodology
||This article is currently Under Construction. It is undergoing major changes as it is in the early stages of development. Users should help contribute to this article to get it to the point where is ready for a Peer Review.
Talend Data Integration, also known as just Octopus is an Open Source ETL tool.
Talend Company Overview
Relationship to MIKE2.0
- Used to deliver the ETL sub-component
Overall Implementation Guide
ETL Design and Implementation Activities of MIKE2.0, which are:
Alignment with Strategic Requirements for Infrastructure Development
- Platform: runs on Windows, Unix and Linux.
- GUI: drag and drop Designer style interface for components, connectors and relationships.
- Code: Generates components as Perl, Java or native SQL code for an ETL and/or ELT transformation approach.
- Source: The Java/Eclipse source code is available for download and customisation.
- Support: A social online community with Talend's wiki, the Talend Forum and a bugtracker.
- Connectivity: Native connectivity stages for Oracle, DB2, MySQL, Sybase and Postgres. ODBC connectivity for other databases.
- Scalability: Supports grid processing and a combination of ETL and ELT for leveraging processing capability of the architecture.
- With the ELT stage it can balance the load between the Talend processing server, cluster or grid and the source, staging or target databases.
- Being open sources makes it suitable as an embedded data integration tool in a complementary product such as business intelligence, data quality or product upgrade. Talend is in use in the JasperSoft Business Intelligence software.
- The community provides a place to get product support and the forum provides free support. While this forum comes without any service guarantees it is monitored by Talend company staff to maintain the quality of the product.
- The GUI ETL interface for import of metadata, configuration and linking of components and generation of code will provide productivity gains for developers and will be faster than standard programming.
- Access to source requires a JDBC driver.
- The ongoing development of Talend is mostly driven by the parent company and funded by a private equity firm and a venture capital firm. There is a risk the company could fail to make premium support services profitable and lose the support of the financial backers. This would remove support options and greatly reduce the ongoing enhancement and innovation in the product and leave it at risk of becoming incompatible with new versions of databases. Since this company is more than two years old and has a large number of users this risk is currently low.
- There are no complementary profiling, data quality or metadata management products. These roles can be fulfilled by third part products but there are few open source options currently available.
- The product can allocate resources across a grid but does not have the underlying automation of data partitioning and repartitioning and this could result in bottleneck jobs.
Functonality that users of the MIKE2.0 Methodology would like to see added to this product are as follows:
User Valuation Enhancements
Voting scores from MIKE2.0 Contributors on the value of the asset in the context of the overall methodology
Talend data Integration download available from product website
Open Source Licensing
Comparable Open Source Products
Reference Implementations through MIKE2.0