Which of the following refers to data in the form of rows and columns?
Which of the following statements is true of BigData?
BigData refers to data sets that are at least a petabyte in size.
Which of the following statements is true of business intelligence publishing alternatives?
It is more difficult publish dynamic BI than to publish static content.
The use of business intelligence for identifying changes in the purchasing patterns of customers is a labor-intensive process. T/F
A ______ is a facility for managing an organization’s business intelligence data.
The curse of dimensionality states that the more attributes there are, the more difficult it is to build a model that fits the sample data. T/F
______ requires users to request business intelligence results.
The use of an organization’s operational data as the source data for a business intelligence system is not usually recommended because it _______.
requires considerable processing and can drastically reduce system performance
The attributes there are in a sample data, the easier it is to build a model that fits the sample data, but that is worthless as a predictor. Which of the following best explains this phenomenon?
the curse of dimensionality
Project management is one of the few domains in which business intelligence is rarely used. T/F
______ refers to the level of detail represented by data.
_____ are reports produced when something out of predefined bounds occurs.
MapReduce is a technique for harnessing the power of thousands of computers working in parallel. T/F
The goal of _______, a type of business intelligence analysis, is to create information about past performance.
Users in a data mart obtain data that pertain to a particular business function from a data warehouse. T/F
BI analysis is the process of obtaining, cleaning, organizing, relating, and cataloging source data. T/F
The purchasing pattern of an individual never change. T/F
________ is the process of sorting, grouping summing, filtering, and formatting structured data.
Static reports are business intelligence documents that are updated at the time they are requested. T/F
Regression analysis is used to identify groups of entities that have similar characterisitcs. T/F
BigData refers to data that have great variety and may have structured data as well as different formats. T/F
An advantage of data warehouses is the low cost required to create, staff, and operate them. T/F
Which is the following statement is true of a data warehouse?
A data warehouse is larger than a data mart.
Which of the following statements is true of data with granularity?
It can be too fine or too coarse and also have wrong granularity.
The three fundamental categories of BI analysis are reporting, data mining, and BigData. T/F
Which of the following activities in the business intelligence process involves delivering business intelligence to the knowledge workers who need it?
_______ is used to measure the impact of a set of variables on another variable during data mining.
Reporting analysis is used primarily for classifying and predicting BI data. T/F
BigData has volume, velocity, and variation characteristics that far exceed those of traditional reporting and data mining. T/F
Which of the following statements is true of Hadoop?
Hadoop is an open source program that implements MapReduce
The skills required to create a publishing application for dynamic content are low. T/F
As information systems, BI systems have three standard components. T/F
_______ techniques emerged from the combined discipline of statistics, mathematics, artificial intelligence, and machine-learning.
If the granularity of certain data is too coarse, the data can be separated into constituent parts using statistical techniques. T/F
______ is an open source program supported by Apache Foundation that manages thousands of computers and that implements MapReduce.
Data marts are usually larger than data warehouses. T/F
Which of the following problems is particularly common for data that have been gathered over time?
lack of consistency
Problematic data are termed _______.
A _______ is a data collection, smaller than the data warehouse that addresses the needs of a particular department or functional area of a business.
The management function of BI servers maintains metadata about the authorized allocation of BI results to users. T/F
Which of the following statements is true of unsupervised data mining?
Analysts create hypotheses only after performing an analysis.
A printed sales analysis is an example of a dynamic report. T/F
_______ is the process of delivering business intelligence to users without any request from the users.
In the ____ phase, a BigData collection is broken into pieces and hundreds or thousands of independent processors search these pieces for something of interest.
In the case of ______, data miners develop models prior to conducting analyses and then apply statistical techniques to data to estimate parameters of the models.
supervised data mining
Push options are manual when emails or collaboration tools are used for BI publishing. T/F
______ is the application of statistical techniques to find patterns and relationships among data for classification and prediction.
______ is the process of obtaining, cleaning, organizing, relating, and cataloging source data.
The data that an organization purchases from data vendors can act as the source data for a business intelligence system. T/F
The results generated in the map phase are combined in the _______ phase.
Which of the following is a fundamental category of business intelligence analysis?
With unsupervised data mining, analysts do not create a model or hypothesis before running the analysis. T/F
Problem solving requires project management. T/F
Placing business intelligence applications on operational servers can dramatically reduce system performance. T/F
Users in a data mart obtain data that pertain to a particular business function from a ________.
The patterns, relationships, and trends identified by BI systems are called business intelligence. T/F
_______ are user requests for particular business intelligence results on a particular schedule or in response to particular events.
Data analysts who work with data warehouses are experts at data management, data cleaning, data transformation, and data relationships. T/F
______ is an unsupervised data mining technique in which statistical identify groups of entities that have similar characteristics.
Plush publishing requires a user to request BI results. T/F
External data purchased from outside resources are not includes in data warehouses. T/F
Structured data is data in the form of rows and columns. T/F
A data warehouse is a facility for managing an organization’s business intelligence data. T/F
Which of the following statements is true of business intelligence systems?
Business intelligence systems analyze an organization’s past performance to make predictions.
The source, format, assumptions and constraints, and other facts concerning certain data are called ______.
_____ are business intelligence documents that are fixed at the time of creation and do not change.
Regression analysis is used in _____.
supervised data mining
_______ are business intelligence documents that are updated at the time they are requested.
_______process operational and other data in organizations to analyze past performance and make predictions.
Business intelligence systems
The granularity in clickstream data is too coarse. T/F
The ______ of business intelligence servers maintains metadata about the authorized allocation of business intelligence results to users.
Data inconsistencies can occur from the nature of a business activity. T/F
Data granularity refers to the amount of data represented by data. T/F
Data marts are data collections that address the needs of a particular department or functional area of a business. T/F
BigData has low velocity and is generated slowly. T/F
A _______ is designed to extract data from operational systems and other sources, clean the data, and store and catalog that data for processing by business intelligence tools.
Cluster analysis measures the impact of a set of variables on another variable. T/F