|
Wiki Home
Members
To join, please contact us. Improve MIKE 2.0
Need somewhere to start? How about the most wanted pages; or the pages we know need more work; or even the stub that somebody else has started, but hasn't been able to finish. Or create a ticket for any issues you have found.
|
Data Re-EngineeringFrom MIKE2.0 Methodology -> You are here: Agile Business Transformation > E-Discovery Solution Offering > Architecture Patterns for Data Synchronisation > Enterprise Knowledge Market > Data Re-Engineering
Activity: Data Re-EngineeringObjectiveData Re-Engineering is a term used to describe a number of related functions for standardising data to a common format, correcting data quality issues, removing duplicate information/building linkages between records that did not exist previously, or enriching data with supplementary information. The MIKE2.0 approach for Data Re-Engineering derives some of its ideas and language from the TIQM Methodology proposed by Larry English [1] for Data Re-Engineering. TIQM provides a very comprehensive approach to preventing and resolving Data Quality issues and is recommended as a complementary reference guide for users of MIKE2.0. Data Re-Engineering is not always conducted within Phase 3 of the MIKE2.0 Methodology, it is an activity that may be conducted throughout the process. In many cases, however, it does make sense to try and address the major data issues before moving data into the target environment. Major Deliverables
TasksPrepare for Data Re-EngineeringObjective: In this task, the team prepares for Data Re-Engineering by ensuring that at least high level information requirements have been established, data extracts are available and the software development environment is ready. Data Profiling is typically a pre-requisite to this task, as it helps to quantitatively understand the data quality issues that exist beforehand and plan appropriately. Generally the same process for acquiring extracts for can be followed for Data Investigation.
Perform Data StandardisationObjective: Data Standardisation addresses problems related to:
The Data standardisation process is used to get data into an agreed-to atomic form, oftentimes mapping in data from complex fields using a vendor tool. Mapping rules from the standardisation processes are ideally fed into a metadata repository.
Perform Data CorrectionObjective: Data Correction typically addresses problems related to:
Perform Data Matching and ConsolidationObjective: In this task, data is associated with other records to identify matching sets. Matching records can then either be consolidated to remove duplications or linked to another to form new associations.
Perform Data EnrichmentObjective: Data Enrichment typically refers to the supplementing on an organisation’s internal data with data from external sources. Types of data that is typically used for enrichment data:
This provides an overall more robust set of information to make key business decisions Input:
Finalise Business Summary of Data Quality ImpactObjective: This task provides a summary of the root causes of the data quality issues that impact the business and recommendations for resolution of these issues. Recommendations may involve changes to source systems, improvements to business processes, increased validation, etc. The report produced should also involve financials to build a business case around resolution of these data quality issues and whether it makes sense to address them from a business perspective. Input:
Output: Core Supporting Assets
Yellow Flags
Key Resource Requirements
Potential Changes to this ActivityA few tasks may need to be added to this activity that are more commonly applied for Search. In particular: lemmatisation and spell checking that are commonly applied through a human interface. Alternatively, may expand the definition of standardisation to cover this area. References
|
Wiki asset search
Toolbox
Views
Wiki Contributors
|

