Data governance requires control mechanisms and procedures for, but not limited to, assignment and tracking of action items.
Preparation and pre-processing of historical data needed in a predictive model may be performed in nightly batch processes or in near real-time.
The flow of data in a data integration solution does not have to be designed and documented.
The operational data quality management procedures depend on the ability to measure and monitor the applicability of data.
A goal of reference and master data is to provide authoritative source of reconciled and quality-assessed master and reference data.
Customer relationship management systems manage Master Data about customers.
In gathering requirements for DW/BI projects, begin with the data goals and strategies first.
Lack of automated monitoring represents serious risks, including compliance risk.
Data modelling tools are software that automate many of the tasks the data modeller performs.
Effective data management involves a set of complex, interrelated processes that enable an organisation to use its data to achieve strategic goals.
Malware refers to any infectious software created to damage, change or improperly access a computer or network.
Subtype absorption: The subtype entity attributes are included as nullable columns into a table representing the supertype entity
Technical Metadata provides data about the technical data, the systems that store data, and the processes that move between systems.
A roadmap for enterprise data architecture describes the architecture’s 3 to 5-year development path. The roadmap should be guided by a data management maturity assessment.
Data profiling also includes cross-column analysis, which can identify overlapping or duplicate columns and expose embedded value dependencies.
Small reference data value sets in the logical data model can be implemented in a physical model in three common ways:
Those responsible for the data-sharing environment have an obligation to downstream data consumers to provide high quality data.
Examples of concepts that can be standardized within the data quality knowledge area include:
Business activity information is one of the types of data that can be modelled.
Controlling data availability requires management of user entitlements and of structures that technically control access based on entitlements.
Data quality rules and standards are a critical form of Metadata. Ti be effective they need to be managed as Metadata. Rules include:
All organizations have the same Master Data Management Drivers and obstacles.
The IBM Data Governance Council model is organized around four key categories. Select the answer that is not a category.
Data warehousing describes the operational extract, cleaning, transformation, control and load processes that maintain the data in a data warehouse.
To mitigate risks, implement a network-based audit appliance, which can address most of the weaknesses associated with the native audit tools. This kind of appliance has the following benefits:
Product Master data can only focus on an organization’s internal product and services.
Improving data quality requires a strategy that accounts for the work that needs to be done and the way people will execute it.
Effective data management involves a set of complex, interrelated processes that disable an organization to use its data to achieve strategic goals.
When constructing models and diagrams during formalisation of data architecture there are certain characteristics that minimise distractions and maximize useful information. Characteristics include:
Please select the correct general cost and benefit categories that can be applied consistently within an organization.
A synonym for transformation in ETL is mapping. Mapping is the process of developing the lookup matrix from source to target structures, but not the result of the process.
Validity, as a dimension of data quality, refers to whether data values are consistent with a defined domain of values.
The implementation of a Data Warehouse should follow these guiding principles:
Modeling Bid data is a non-technical challenge but critical if an organization that want to describe and govern its data.
Elements that point to differences between warehouses and operational systems include:
A goal of data governance is to enable an organisation to manage its data as a liability.
Field overloading: Unnecessary data duplication is often a result of poor data management.
A limitation of the centralized approach include: Maintenance of a decentralized repository is costly.
A control activity in the metadata management environment includes loading statistical analysis.
If the target system has more transformation capability than either the source or the intermediary application system, the order of processes may be switched to ELT – Extract Load Tranform.
An image processing system captures, transforms and manages images of paper and electronic documents.
Several global regulations have significant implications on data management practices. Examples include: