1
Data Warehousing Questions 1. What is a Data Mart? a) Database's used by a single business analyst b) Database's used by the whole business organization c) Scaled down version of a Data Warehouse usually developed to solve a particular business problem d) A "view" of the Data Warehouse created within the database management system What is Data Webhouse? a) An Active Data Warehouse b) A Data Warehouse which has the data feed from the Internet log and related data, and the information from the Data Warehouse web enabled c) A collection Data Marts inter-connected as a Web d) A Operational Data Store, Data Mart and Data Warehouse is called a Data Webhouse Data Mart is a) Single Subject Oriented Data Warehouse b) A collection of Data Warehouse c) An Application on a Data Warehouse d) None of the Above Which one of the following is NOT an example of Operational Systems? a) Order Tracking Applications, such as catalog sales b) Customer service applications, such as setting up customer accounts c) Banking functions, such as deposits and withdrawals d) Sales forecasting applications Which layer of the data warehouse architecture does an end user directly deals with a) Staging layer b) External data layer c) Information access layer d) Data warehouse layer What should be the primary source of data for a data mart? a) A subset of the data warehouse created in the database management system b) Data extracted from the target systems c) The operational databases d) Data extracted from the data warehouse databases Where is Meta data usually store? a) Word Processing documents b) Spread sheets c) Information Repository d) All the above Which of the following are the examples of dimensions? a) Customers b) Time c) Product d) All the above The process of populating a data warehouse is called a) Loading b) Extracting c) Transforming d) None of the Above 12. What is the advantage in going for a Bottom Up approach in building a Enterprise Data Warehouse a) Data Cleaning is not required b) Ensures that all departments get a Data Mart c) Return on Investment can be felt at the earliest d) Data Loading is Faster 13. Which of the following must be a function of a data Extraction and transformation tool? a) Ability to retrieve data from all known database management systems b) Ability to store the data mart database designs and make those designs available to the business analysts c) Ability to translate data elements in the source systems into data warehouse data d) Ability to run on all known platforms and operating systems 14. The process of requesting detailed information a) Drill up b) Drill left c) Drill down d) Drill right 15. What is the difference between a Data Warehouse or Data Mart and an Operational Data Store? a) An operational data store contains more current data than either a Data Warehouse or a Data Mart b) An operational data store and a data warehouse or data mart track different subject areas in the organization c) An operational data store is a copy of the data warehouse or data mart d) The operational data store tends to be larger than the data warehouse or data mart 16. Data Warehousing Characteristics (i) Subject Oriented (ii) Non Volatile (iii) Time Variant (iv) Integrated (v) Time Invariant a) (iii) &(i)&(iv)&(v) Only b) (i)&(ii)&(iv)&(v) Only c) (i)&(ii)&(iii)&(iv) Only d) All 17. Which process loads the data from heterogeneous source systems to the data warehouse a) Cleaning b) Mining c) ETL Process d) None of the Above 18. What does the term "Ad-hoc Analysis" mean? a) Business analysts access the Data Warehouse data from different locations. b) Business analysts use sampling techniques c) Business analysts start query and analysis on the fly d) Business analysts access the Data Warehouse data infrequently 19. Sparse Data in OLAP Cube indicates a) Missing Data b) Data Repetitions c) Zeroes d) Rare Data 20. What is an Operational System? a) An application system that supports the organization's day to day activities b) An application system that tracks and manages the financial assets of the organization c) An application system that supports the creation of products(s) that the organization markets d) An application system that supports the planning and forecasting within the organization
2.
3.
4.
5.
6.
7.
8.
9.
10. Architected Data Warehouse a) Reduces server and client processing required b) Does not maintain historical integrity c) Inefficient data acquisition d) None of the Above 11. Which one of the following are called 'Stove Pipe Systems'? a) DataWarehouse b) Dependent Data Marts c) Independent Data Marts d) Operational Data Stores
2
21. Which of the following are the advantages of creating individual marts and then rolling them up into a central warehouse? 1)Quick Successes 2)Rapid prototyping of data transformations 3)Duplication of data a) 1 only b) 1 and 2 only c) 1, 2 and 3 d) None of the Above 22. Disadvantage of data mart is a) Does not provide integrated view of information b) Uncontrolled proliferation results in redundancy c) More number of Data Marts are complex to maintain d) All of the above 23. Which of the following statements correctly describe a dimension table in Dimensional Modeling? a) Dimension tables do not contain numeric fields b) Dimension tables do not need system-generated keys c) Dimension tables usually have fewer fields than fact tables d) Dimension tables contain fields that describe the facts 24. What is the most common use of a multi dimensional database (MDDB)? a) Access pre determined aggregated data across several dimensions b) Access huge data warehouses c) Access application packages d) As the only type of database management system used for a data warehouse 25. Table Denormalization (a)Improves Query performance (b) Duplicates Information a) Only (a) True b) Only (b) True c) Both (a) and (b) True d) None 26. Bit Mapped Indexing can be used a) When the distinct value of a column is high b) When the distinct value of a column is low c) When the storage space available is less d) When the table is having very few rows 27. Which of the following is NOT a type of process typically done by a OLAP tool? a) Creation of large, detailed transaction level reports b) "WHAT IF" analysis c) "Slicing and Dicing" of the data with drill down when something interesting is found. d) Time series analysis 28. Scheduling of various tasks needed to build and maintain a data warehouse is taken care of by a) Staging layer b) Process management layer c) Information access layer d) Transformation layer 29. What is an Operational Data Store (ODS)? a) A set of database that support reporting from an application system b) A set of databases that provide integrated operations data to serve the organization's day to day activities c) A set of database to provide operational data for a single department d) A set of databases that support OLAP 30. OLAP that accesses the raw data lying in the RDBMS for reporting is called as a) Multidimensional OLAP b) Relational OLAP c) Hybrid OLAP d) All the above 31. Data Warehouses and Data Marts assist a) Reports to regulatory agencies b) Audit reporting c) Decision Support d) Accounting Reporting 32. Operational Data Store (ODS) Characteristics (a) Subject Oriented (b) Volatile (c) Current or Near Current collection of data (d) integrated a) (b)&(a) b) (a)&(b)&(c)&(d) c) (d)&(c) d) (a)&(b)&(c) 33. What is a Fact Table in a Data Warehouse terminology? a) Any Table that has the history of a Business. b) Table that contains Time related data c) Any Table present in a Data Warehouse Table d) Table that contains measurable data. 34. What is the difference between a Data Warehouse or Data Mart and an Operational Data Store? a) An operational data store contains more current data than either a Data Warehouse or a Data Mart b) An operational data store and a data warehouse or data mart track different subject areas in the organization c) An operational data store is a copy of the data warehouse or data mart d) The operational data store tends to be larger than the data warehouse or data mart 35. Why data in a Data Warehouse called as Time Variant? a) Because data in the data warehouse is accurate as of some moment in time b) Because every key structure in the data warehouse contains - implicitly or explicitly -an element of time, such as day, week, month, etc. c) (a)&(b) d) Only (a) 36. Data Warehouse is a) Collection of History Data b) Query Centric c) Decision Support System d) All the above 37. Enterprise Data warehouse contains a) Only detailed data b) Only summarized data c) Detailed and summarized data d) None of the Above 38. What is Snow Flake Schema in a Database design? a) The Dimension Tables have a Foreign Key Table. b) The Dimension Tables do not have a Foreign Key Table. c) The Fact Table has one Dimension Table d) The Dimension table is Only of Time 39. In order to be successful at decision-making, what does an organization need? a) Experienced decision makers b) Adequate and timely data c) A corporate strategy d) All the above 40. Non Architectured Data Marts are also known as a) Legacy Data Marts b) Lega Marts c) Non-Integrated Data Marts d) All of the above
3
41. In building a data warehouse 60 - 80 % of work is required in which stage? a) Database Design b) ETL deployment c) OLAP deployment d) Cleaning 42. Categories of OLAP Tools: Level 1- Basic query and display of data; Level 2- Level 1 + advanced selection and arithmetic operations; Level 3- Level 1 and Level 2 + sophisticated data analysis techniques. Which of the following is an example of a process a) Display a report based on specific selection criteria b) Drill down to another level of detail c) Calculate a rolling average on a set of data d) Display the top 10 items that meet a specific selection criteria 43. The Fact Table is related to a Dimension Table in the Dimensional Modeling by a) One to Many b) Many to One c) One to One d) Many To Many 44. Which of the following are considered to be advantages while building a data warehouse? a) The ability to access Enterprise-Wide data b) The ability to have consistent data c) The ability to perform analysis quickly d) All the above 45. Ad-hoc access path, low transaction volume, low number of users are the characteristics of which system a) Informational system b) Decision support system c) Operational system d) Transactional system 46. The key performance indicators of an enterprise are a) Dimension attributes b) Facts, Measures c) Summary data d) Detailed data 47. What is an Active Data Warehouse? a) A Production Data Warehouse b) Close-coupled OLTP and Data Warehouse c) Close-coupled Data Marts d) None 48. Executing a decision support query against an operational system usually results in a) No change in performance b) Degradation in performance c) Improvement in performance d) None 49. Data can be cleaned (a)at the source(b)during transformation(c)in the Data Warehouse a) Only (a) b) Only (b) c) Could be by all methods or any one of the methods based on the Environment of Data Warehouse Setup d) Only (c) 50. What modeling technique would be used to design a specific database that will be implemented with star schema? a) Object Modeling b) Entity Relationship Modeling c) Dimensional Modeling d) None of the Above 51. The person who defined rules for OLAP(OnLine Analytical Processing) is a) Bill Inmon b) Ralph Kimball c) C J Date d) E F Codd 52. What is a Dimension Table in Data Warehouse terminology? a) Any Table present in a Data Warehouse Table b) Table of members, positions, or units of the same type which is used as categories by which data is analyzed c) Any Table storing measurable unit d) Table storing Data type that has a value to be analyzed 53. In which schema only primary dimension tables are joined to fact tables? a. Star Schema b. Snow flaked schema c. Both star schema and snow flaked schema d. None of the Above 54. What is the substantial benefit of implementing a Data Warehouse or Data Mart? a) Improves the morale of the organization's knowledge workers b) Provides many new tools to the executives within the organization c) Increases technical knowledge within the executive ranks of the organization because they will have to learn to use a computer d) Improves decision-making within the organization 55. OLAP that accesses its own proprietary Data Storage for reporting is called as a) Multidimensional OLAP b) Relational OLAP c) Hybrid OLAP d) All the above 56. Characteristics of dimension tables are a) Contains a primary key b) Has one to many relationship to the fact table c) Contains other attribute columns that are useful for levels of aggregation d) All the above 57. The type of Modeling used for the Database designing in Star Schema is a) ER Modeling b) Dimensional Modeling c) Any Method d) All the methods 58. The following statement fits for Data Mining (a) Its an ad hoc reporting tool (b) Its a technology to find Hidden patterns in huge data a) Only (a) b) Only (b) c) Both (a) and (b) d) None 59. Degenerate dimensions can be represented as a) An entry in the fact table without an associated dimension table b) Dimension table that contain all attributes that are necessary to provide query values c) An entry in the fact table with an associated dimension table d) None of the Above
4
60. REDBRICK and TERADATA databases are mostly used for a) OLTP applications b) OLAP applications c) Both OLTP and OLAP applications d) OO Database applications 61. Which one of the following is NOT an example of measures? a) Order Quantity b) Product name c) Dollar Value d) Inventory Count 62. Which of the following is a problem/analysis/study that could be assisted by using Clustering Data Mining Technique a) Market Basket Analysis b) Credit Card Fraud Detection c) Campaign Marketing d) Time Series Analysis 63. Which one of the following is the complete general process in the order involved in building a Data Warehouse? a) Evaluate Data ->Extract Data ->Store Data b) Extract Data ->Evaluate Data c) Extract Data->Store Data ->Evaluate Data d) None of the Above 64. Which one of the following is NOT an advantage of breaking a Data Warehouse up into smaller Data Marts a) Data Marts are guaranteed to have shared fields b) All data transformations from sources are common c) It takes longer to develop the Data Warehouse due to increased time for advanced planning and design d) None of the Above 65. Which of the following statements are true about OLAP? a) Ad Hoc Reporting Tool b) Has Drill through, Slice & Dice facility c) Built on a Data Warehouse/Data Mart d) All the above 66. Casual dimensions can be used a) As a helper table b) For explaining why a record exists in a fact table c) For integrating data marts into a data warehouse d) For handling changes to the data. 66. Factless Fact tables are used a) For event tracking b) For handling multi valued dimensions c) As a helper table d) None of the above 67. Helper tables are used a) For handling one to many relationships b) For handling many to one relationships c) For handling many to many relationships d) None of the above 68. Which of the following is not true with for an ODS? a) It is integrated and subject oriented b) Contains current and detailed data c) Contains years of historical data d) Used for making tactical decisions 69. Conformed dimensions are used a) For handling multi valued dimensions b) For integrating data marts into a data warehouse c) For event tracking d) For explaining why a record exists in a fact table 70. Which of the following statements are not true? a) A collection of data marts does not equal a data warehouse. b) A data warehouse contains summarized data. c) A data warehouse contains subject oriented, volatile, integrated and cleansed data. d) None of the above 71. Metadata is associated with the following a) Extraction Transformation and Loading b) Front end access c) Data warehouse administration d) All of the above 72. Compound Growth Factor (CGF) is associated with a) Data explosion b) Data integration c) Scalability d) Data warehouse performance 73. The relationship between a fact table and a dimension table is a) One to One b) One to Many c) Many to One d) Many to Many 74. Which of the following is suited for large, transaction intensive applications? a) ROLAP b) MOLAP c) HOLAP d) DOLAP 75. An Architected Data Warehouse a) Does not maintain historical integrity b) Is inefficient with respect to data acquisition c) Reduces server and client processing required d) Does not store historical data 76. A fact table is linked to a dimension table using a) Surrogate keys b) Compound keys c) Complex keys d) None of the above 77. The following methods can be used for handling slowly changing dimensions a) Inserting a new field in the table b) Inserting a new record in the table c) Overwrite the existing data d) All of the above 78. In a Bottom Up data warehouse architecture a) The data warehouse is created first and then the data marts b) The data marts are created first and then the data warehouse c) A set of independent data marts are created d) An enterprise data warehouse is created without creating data marts 79. In a Top Down data warehouse architecture a) The data warehouse is created first and then the data marts b) The data marts are created first and then the data warehouse c) A set of independent data marts are created d) An enterprise data warehouse is created without creating data marts 80. Which of the statements are not true? a) A data mart contains less information than a data warehouse b) A data mart covers a single subject area c) Data marts can be used for creating a data warehouse d) None of the above. 81. Which of the statements are not true for MOLAP Architecture? a) Processing overhead for large input data sets is high b) The no of dimensions is usually restricted to 10 or less c) Scalability is good d) It has a good user interface and functionality 82. Which of the following is associated with coverage tables? a) Factless fact tables b) Conformed dimensions c) Casual dimensions d) Helper tables
5
83. Which of the following statements are false for surrogate keys? a) Surrogate keys should not be composed of natural keys glued together. b) Surrogate keys should ideally be numeric integers. c) Surrogate keys should contain meaningful information. d) None of the above 84. How are dimensions in a Multi Dimensional database related? a) Hierarchically b) In a Network c) By an inverse list d) Non of the above 85. The process of standardizing the data that is extracted from the operational systems is known as a) Transformation b) Cleansing c) Population d) Data Staging 86. Which of the following is used for calculating the Compound Growth Factor for an application? a) Sparsity of the data b) Clustering of the data c) Number of dimensions and the number of levels in the dimension hierarchy d) All of the above 87. Which of the following is an MOLAP tool ? a) Holos (Segate Software) b) Business Objects c) Brio Query Enterprise d) DSS Server (MicroStrategy) 88. The term Corporate Information Factory is associated with a) Bill Inmon b) Ralph Kimball c) Sid Adelman d) Chuck Kelley 89. Which of the following is suited for Near Line Storage? a) Photo optical storage b) Siloed tape storage c) A and B d) None of the above 90. Which of the following statements are false for ROLAP? a) Query performance is slower compared to MOLAP b) Suitable for frequent updates c) Scalability is poor d) Summary tables are implemented in the relational database 91. Which type of modeling technique is associated with a data warehousing? a) Dimensional Modeling b) ER Modeling c) Object Oriented Modeling d) All of the above 92. OLAP that access its own proprietary data storage for reporting is called as a) Multidimensional OLAP b) Relational OLAP c) Hybrid OLAP d) All of the above 93. Which of the following is not an ETL tool? a) Power Mart b) Data Stage c) Scenario d) Data Junction 94. Majority of work involved in a data warehousing project is carried out in this phase a) Requirements gathering b) Data warehouse design c) d) ETL Data warehouse testing
95. What two types of processing are done in an exploration warehouse? a) Exploration and Data Mining b) OLAP and Data Mining c) Exploration and OLAP d) Data Mining and Multidimensional analysis 96. Data that is loaded into a data warehouse and never used is referred to as a) Dormant data b) Static data c) Operational data d) Time dependant data 97. What determines the class of an ODS? a) The type of data that is stored in the ODS b) The speed of update synchronization from the operational environment to the ODS c) The duration of data that is stored in the ODS. d) The amount of time it takes to update the ODS 98. What is meant by a wrinkle of time? a) The time it takes to populate a data warehouse b) The amount of time that elapses between the update of a record in the operational environment and the time that the update is reflected in the data warehouse c) The granularity of the time dimension that is present in the data warehouse d) The time it takes to populate the data warehouse from the data staging area 99. What is the metadata called that is used for the corporate information factory? a) Exploration metadata b) Corporate metadata c) Distributed metadata d) Integrated metadata 100. When comparing ROLAP with MOLAT which of the following statements are true? a) User interface and functionality is good in ROLAP and normal in MOLAP. b) MOLAP and ROLAP stores details and summarized data. c) The common access language for ROLAP and MOLAP is SQL. d) ROLAP support for a large number of users is good where as there is limited support in the case of MOLAP.