MIS Final Review Chapter 9

Which of the following refers to data in the form of rows and columns?
structured data
Which of the following statements is true of BigData?
BigData refers to data sets that are at least a petabyte in size.
Which of the following statements is true of business intelligence publishing alternatives?
It is more difficult publish dynamic BI than to publish static content.
The use of business intelligence for identifying changes in the purchasing patterns of customers is a labor-intensive process. T/F
False
A ______ is a facility for managing an organization’s business intelligence data.
data warehouse
The curse of dimensionality states that the more attributes there are, the more difficult it is to build a model that fits the sample data. T/F
False
______ requires users to request business intelligence results.
Pull publishing
The use of an organization’s operational data as the source data for a business intelligence system is not usually recommended because it _______.
requires considerable processing and can drastically reduce system performance
The attributes there are in a sample data, the easier it is to build a model that fits the sample data, but that is worthless as a predictor. Which of the following best explains this phenomenon?
the curse of dimensionality
Project management is one of the few domains in which business intelligence is rarely used. T/F
False
______ refers to the level of detail represented by data.
Granularity
_____ are reports produced when something out of predefined bounds occurs.
Exception reports
MapReduce is a technique for harnessing the power of thousands of computers working in parallel. T/F
True
The goal of _______, a type of business intelligence analysis, is to create information about past performance.
reporting analyses
Users in a data mart obtain data that pertain to a particular business function from a data warehouse. T/F
True
BI analysis is the process of obtaining, cleaning, organizing, relating, and cataloging source data. T/F
False
The purchasing pattern of an individual never change. T/F
False
________ is the process of sorting, grouping summing, filtering, and formatting structured data.
reporting analysis
Static reports are business intelligence documents that are updated at the time they are requested. T/F
False
Regression analysis is used to identify groups of entities that have similar characterisitcs. T/F
false
BigData refers to data that have great variety and may have structured data as well as different formats. T/F
True
An advantage of data warehouses is the low cost required to create, staff, and operate them. T/F
False
Which is the following statement is true of a data warehouse?
A data warehouse is larger than a data mart.
Which of the following statements is true of data with granularity?
It can be too fine or too coarse and also have wrong granularity.
The three fundamental categories of BI analysis are reporting, data mining, and BigData. T/F
True
Which of the following activities in the business intelligence process involves delivering business intelligence to the knowledge workers who need it?
publish results
_______ is used to measure the impact of a set of variables on another variable during data mining.
Regression analysis
Reporting analysis is used primarily for classifying and predicting BI data. T/F
False
BigData has volume, velocity, and variation characteristics that far exceed those of traditional reporting and data mining. T/F
True
Which of the following statements is true of Hadoop?
Hadoop is an open source program that implements MapReduce
The skills required to create a publishing application for dynamic content are low. T/F
False
As information systems, BI systems have three standard components. T/F
False
_______ techniques emerged from the combined discipline of statistics, mathematics, artificial intelligence, and machine-learning.
Data mining
If the granularity of certain data is too coarse, the data can be separated into constituent parts using statistical techniques. T/F
False
______ is an open source program supported by Apache Foundation that manages thousands of computers and that implements MapReduce.
Hadoop
Data marts are usually larger than data warehouses. T/F
False
Which of the following problems is particularly common for data that have been gathered over time?
lack of consistency
Problematic data are termed _______.
dirty data
A _______ is a data collection, smaller than the data warehouse that addresses the needs of a particular department or functional area of a business.
data mart
The management function of BI servers maintains metadata about the authorized allocation of BI results to users. T/F
True
Which of the following statements is true of unsupervised data mining?
Analysts create hypotheses only after performing an analysis.
A printed sales analysis is an example of a dynamic report. T/F
False
_______ is the process of delivering business intelligence to users without any request from the users.
Push publishing
In the ____ phase, a BigData collection is broken into pieces and hundreds or thousands of independent processors search these pieces for something of interest.
map
In the case of ______, data miners develop models prior to conducting analyses and then apply statistical techniques to data to estimate parameters of the models.
supervised data mining
Push options are manual when emails or collaboration tools are used for BI publishing. T/F
True
______ is the application of statistical techniques to find patterns and relationships among data for classification and prediction.
Data mining
______ is the process of obtaining, cleaning, organizing, relating, and cataloging source data.
Data acquisition
The data that an organization purchases from data vendors can act as the source data for a business intelligence system. T/F
True
The results generated in the map phase are combined in the _______ phase.
reduce
Which of the following is a fundamental category of business intelligence analysis?
reporting
With unsupervised data mining, analysts do not create a model or hypothesis before running the analysis. T/F
True
Problem solving requires project management. T/F
False
Placing business intelligence applications on operational servers can dramatically reduce system performance. T/F
True
Users in a data mart obtain data that pertain to a particular business function from a ________.
data warehouse
The patterns, relationships, and trends identified by BI systems are called business intelligence. T/F
True
_______ are user requests for particular business intelligence results on a particular schedule or in response to particular events.
Subscriptions
Data analysts who work with data warehouses are experts at data management, data cleaning, data transformation, and data relationships. T/F
True
______ is an unsupervised data mining technique in which statistical identify groups of entities that have similar characteristics.
cluster analysis
Plush publishing requires a user to request BI results. T/F
False
External data purchased from outside resources are not includes in data warehouses. T/F
False
Structured data is data in the form of rows and columns. T/F
True
A data warehouse is a facility for managing an organization’s business intelligence data. T/F
True
Which of the following statements is true of business intelligence systems?
Business intelligence systems analyze an organization’s past performance to make predictions.
The source, format, assumptions and constraints, and other facts concerning certain data are called ______.
metadata
_____ are business intelligence documents that are fixed at the time of creation and do not change.
Static reports
Regression analysis is used in _____.
supervised data mining
_______ are business intelligence documents that are updated at the time they are requested.
Dynamic reports
_______process operational and other data in organizations to analyze past performance and make predictions.
Business intelligence systems
The granularity in clickstream data is too coarse. T/F
False
The ______ of business intelligence servers maintains metadata about the authorized allocation of business intelligence results to users.
management function
Data inconsistencies can occur from the nature of a business activity. T/F
True
Data granularity refers to the amount of data represented by data. T/F
False
Data marts are data collections that address the needs of a particular department or functional area of a business. T/F
True
BigData has low velocity and is generated slowly. T/F
False
A _______ is designed to extract data from operational systems and other sources, clean the data, and store and catalog that data for processing by business intelligence tools.
data warehouse
Cluster analysis measures the impact of a set of variables on another variable. T/F
False