|
Wiki Home
Members
To join, please contact us. Improve MIKE 2.0
Need somewhere to start? How about the most wanted pages; or the pages we know need more work; or even the stub that somebody else has started, but hasn't been able to finish. Or create a ticket for any issues you have found.
|
Data ProfilingFrom MIKE2.0 Methodology -> You are here: JFreeReport > Talk:OmCollab > Open Methodology Compliance > Category:Enterprise Content Management Offering Group > Data Profiling
Activity: Data ProfilingObjectiveData Profiling focuses on conducting an assessment of actual data and data structures. It helps provide the following:
The purpose of this phase is to provide objective, information-based results around information analysis. Major Deliverables
TasksPrepare for AssessmentObjective: In this task, it ensured that the profiling environment is ready and the scope of information to be investigated is agreed-upon within the team and signed off by the client. This may be challenging as at this stage some of the other requirements may still be somewhat undefined. Minimising any gaps in time in getting the necessary extract files is a critical area dependency and risk area during this task. As profiling requires production extracts, the timelines for procurement of data may be significant.
Perform Column ProfilingObjective: This task involves profiling the data found in a single column/field in either a table or a flat file. It involves analysis of simple and complex fields. Each task is done on a per-system (or subset of a system) basis. Key steps to column profiling include:
Simple and Complex field profiling may be split into separate tasks.
Output:
Perform Table ProfilingObjective: This task involves analysing data across rows of a single table to establish dependencies between attributes within each table. Each task is done on a per-system (or subset of a system) basis. Key steps to table profiling include:
On completion, further analysis may be planned based on results.
Perform Multi-Table ProfilingObjective: This task involves analysing data across tables to look for redundancy and referential integrity issues. Tasks may done on a per-systems basis, sub-system basis or across systems. Key steps to table profiling include:
Input:
Output:
Finalise Data Quality ReportObjective: This step will complete and issue for signoff the Data Quality Assessment Report. Sections of the Data Quality Assessment Report and metadata repository should be populated throughout the End-to-End profiling exercise. This step is to complete remaining sections and to make a final recommendation on whether this data should be loaded into the target system. On completion, there should be a formal walkthrough, review and final signoff. Input:
Core Supporting Assets
Yellow Flags
Key Resource Requirements |
Wiki asset search
Toolbox
Views
Wiki Contributors
|

