Google+ COMPUTER TRICKS, TWEAKS AND TUTORIALS: CS614 – MF-2010 Paper 2

CS614 – MF-2010 Paper 2


CS614 – MF-2010
Paper 2

Which of the following statement is true? 1 GB is
230 or 109 bytes
230 or 106 bytes
232 or 109 bytes
232 or 108 bytes

Normally Selectivity of query in data warehouse is
High
Low
Not measured

Normalization is the process of efficiently organizing data in a database by ________ a relational table into smaller tables by projection.
Composing
Joining / Merging
Combining
Decomposing

Fourth normal form (4NF) has an additional requirement, which is
Data is in 2NF and there is no Multi-valued dependency
Data is in 3NF and there is no foreign key in tables
Data is in 3NF and there is no Multi-valued dependency
Data is in 3NF and there is no NULL key dependency

The most common use of range partitioning in data warehouse is on
Date
Most redundant column
Fact
Dimensions

One of the OLAP characteristics is Multi-dimensional, which is ________ for OLAP.
Essential
Optional
Discretionary
Not Obligatory

Dimensional modeling techniques focus on the concepts of _______ and _____
None of these
Facts, Dimensions
Facts, hierarchy
A central table, dimensional tables

The goal of star schema design is to simplify ________
Logical data model
Physical data model
Conceptual data model
None of these

Single value attributes during recording of a transaction are __________
Dimensions
Facts
Aggregates

A ________ dimension is a collection of random transactional codes, flags and/text attributes that are unrelated to any particular dimension. The ______ dimension is simply a structure that provides a convenient place to store the ______ attributes.
Junk
Time
Parallel
None of these

Full and Incremental extraction techniques are types of ____________
Logical Extraction
Physical Extraction
Both Logical and Physical Extraction
None of these

Once the data has been transformed and ready to be loaded in to data warehouse, we adopt one of two prevalent ________ strategies.
Loading
Transformation
Quality
Indexing

Lexical errors fall in which type of class of anomalies
Syntactically Dirty Data
Semantically Dirty Data
Coverage Anomalies
Missing Values Anomalies

Within the data warehousing field, data ________ is applied especially when several databases are merged.
Extraction
Loading
Cleansing
Join

All data is ______________ of something real.

I                         An Abstraction
II                        A Representation

Which of the following option is true?
  • I Only
  • II Only
  • Both I & II
  • None of I & II

Since this form is useful for longitudinal comparisons illustrating trends of continuous improvement. Many traditional data quality metrics, such as free-of-error, completeness, and consistency take this form. This statement is about which of the following:
Simple Ratio
Min Operation
Max Operation
Weighted Average

A single database, couldn’t serve both operational high performance transaction processing and DSS, analytical processing, all at the same time.
True
False

________ gives total view of an organization
OLTP
Data warehouse
Data base
OLAP

In _________ system, the contents change with time.
OLTP
DSS
ATM
OLAP

Redundancy causes _________ anomalies
Update
Select
Both Update & Select
None of these

Briefly describe the features of DOLAP.              2

What is meant by Dirty Data?                                2

What do we mean by subjective and objective data quality assessments? 3

How we can handle the situation where we want to use old value both before and after the change? 3

What is the effect of space requirement on ROLAP?               5

Give Formal and Informal definition of quality, explain this term by a real life time example. Also explain the difference between Intrinsic Data Quality and Realistic Data Quality.   5