320+ Data Mining Solved MCQs

201.

Data mining methodology states that in optimal situation data mining is an _____.

A. standard process.
B. complete process.
C. creative process.
D. ongoing process.
Answer» D. ongoing process.
202.

The ___ algorithm can be applied in cleaning data.

A. search.
B. pattern recognition.
C. learning.
D. clustering.
Answer» B. pattern recognition.
203.

________ is the type of pollution that is difficult to trace.

A. Duplication of records.
B. Ambiguition.
C. Lack of domain consistency.
D. Lack of information.
Answer» C. Lack of domain consistency.
204.

In coding stage of KDD process one should be conscious in ______.

A. adding the data.
B. deleting the data.
C. coding.
D. discovering patterns.
Answer» B. deleting the data.
205.

The statement that is true about data mining is ______.

A. data mining is not a single technique.
B. it finds the hidden patterns from data set.
C. it is a real discovery process.
D. all forms of pollutions are found during the data mining stage itself.
Answer» D. all forms of pollutions are found during the data mining stage itself.
206.

The first step in data mining project is ________.

A. rough analysis of data set using traditional query tools.
B. cleaning the data.
C. recognizing the patterns.
D. visualizing the patterns.
Answer» A. rough analysis of data set using traditional query tools.
207.

SQL can find ________ type of data.

A. narrow data.
B. multidimensional data.
C. shallow data.
D. hidden data.
Answer» C. shallow data.
208.

_______ is used to find relationship between multidimensional data.

A. K-nearest neighbor.
B. Decision trees.
C. Association rules.
D. OLAP tools.
Answer» D. OLAP tools.
209.

Which one of the following is not true about OLAP?

A. They create no new knowledge.
B. OLAP is powerful that data mining tool.
C. They cannot search for new solution.
D. OLAP tool store their data in special multidimensional format.
Answer» B. OLAP is powerful that data mining tool.
210.

Association rules are always defined on________.

A. binary attribute.
B. single attribute.
C. relational database.
D. multidimensional attribute.
Answer» A. binary attribute.
211.

Neural network are modeled on the ___________.

A. neuron.
B. network.
C. human brain.
D. machines.
Answer» C. human brain.
212.

______ built perceptron.

A. Kohenon.
B. Frank Rosenblatt.
C. Minsley.
D. Papert.
Answer» B. Frank Rosenblatt.
213.

The input unit of perceptron is called as_______.

A. associators.
B. responders.
C. neuron.
D. photo receptors.
Answer» D. photo receptors.
214.

The intermediate layers in a back-propagation network consists of__________.

A. photo receptors.
B. responders.
C. hidden nodes.
D. associators.
Answer» C. hidden nodes.
215.

The area for exchange of views between biology and computer science is called as__________.

A. evolutionary programming.
B. evolutionary computing.
C. evolutionary strategies.
D. genetic algorithm.
Answer» B. evolutionary computing.
216.

Genetic algorithm is viewed as a kind of______.

A. meta learning strategy.
B. machine learning.
C. evolution.
D. OLAP tool.
Answer» A. meta learning strategy.
217.

The _________is a knowledge that can be found by using pattern recognition algorithm.

A. hidden knowledge.
B. deep.
C. shallow.
D. multidimensional.
Answer» A. hidden knowledge.
218.

The process of finding the right formal representing of a certain body of knowledge in order to represent it in knowledge based system is__________.

A. re-engineering.
B. replication.
C. knowledge engineering.
D. reverse engineering.
Answer» C. knowledge engineering.
219.

The performance of _______is better at problem solving techniques

A. neural network.
B. classification trees.
C. decision tree.
D. genetic knowledge.
Answer» D. genetic knowledge.
220.

The quality of output depends on ____________.

A. algorithm that is capable of learning rules.
B. algorithm that can handle large data sets.
C. algorithm that can learn incrementally.
D. algorithm that performs well.
Answer» A. algorithm that is capable of learning rules.
221.

In CAPITANS _________ algorithm is used to predict the pilots behavior.

A. neural network.
B. genetic algorithm.
C. association.
D. decision tree.
Answer» B. genetic algorithm.
222.

OR methods deals with _______type of data.

A. quantitative.
B. qualitative.
C. standard.
D. predict.
Answer» A. quantitative.
223.

Shannons notation of information content of message is_______.

A. Log 1divided by n equals log n.
B. log n equals log 1divided by n.
C. log 1divided by n equals minus log n.
D. log minus n =log 1divided by n.
Answer» C. log 1divided by n equals minus log n.
224.

Which of the following features usually applies to data in a data warehouse

A. Data are often deleted.
B. Most applications consist of transactions.
C. Data are rarely deleted.
D. Relatively few records are processed by applications.
Answer» C. Data are rarely deleted.
225.

Which of the following is true

A. The data warehouse consists of data marts and operational data
B. The Data Warehouse consists of data marts and application data.
C. The Data Warehouse is used as a source for the operational data.
D. The operational data are used as a source for the data warehouse
Answer» D. The operational data are used as a source for the data warehouse
226.

How do you better define a data warehouse as

A. Can be updated by end users.
B. Contains numerous naming conventions and formats.
C. Organized around important subject areas.
D. Contains only current data.
Answer» C. Organized around important subject areas.
227.

Which of the following is an operational system

A. A system that is used to run the business in real time and is based on historical data
B. A system that is used to run the business in real time and is based on current data.
C. A system that is used to support decision making and is based on current data.
D. A system that is used to support decision making and is based on historical data.
Answer» B. A system that is used to run the business in real time and is based on current data.
228.

The generic two-level data warehouse architecture includes _______________.

A. at least one data mart.
B. data that can extracted from numerous internal and external sources.
C. near off-time updates.
D. historic data.
Answer» B. data that can extracted from numerous internal and external sources.
229.

Which of the following is reconciled data

A. Current data intended to be the single source for all decision support systems
B. Data stored in the various operational systems throughout the organization.
C. Data stored in one operational system in the organization.
D. Data that has been selected and formatted for end-user support applications.
Answer» A. Current data intended to be the single source for all decision support systems
230.

________analysis divides data into groups that are meaningful, useful, or both.

A. Cluster.
B. Association.
C. Classifiction.
D. Relation.
Answer» A. Cluster.
231.

Which of the following is an extract process

A. Capturing all of the data contained in various operational systems.
B. Capturing a subset of the data contained in various operational systems.
C. Capturing all of the data contained in various decision support systems.
D. Capturing a subset of the data contained in various decision support systems.
Answer» B. Capturing a subset of the data contained in various operational systems.
232.

Which of the following is the not a types of clustering?

A. K-means.
B. Hiearachical.
C. Partitional.
D. Splitting.
Answer» D. Splitting.
233.

The load and index is __________.

A. a process to upgrade the quality of data before it is moved into a data warehouse.
B. a process to upgrade the quality of data after it is moved into a data warehouse.
C. a process to reject data from the data warehouse and to create the necessary indexes.
D. a process to load the data in the data warehouse and to create the necessary indexes.
Answer» D. a process to load the data in the data warehouse and to create the necessary indexes.
234.

Data Transformation includes____________.

A. a process to change data from a detailed level to a summary level.
B. a process to change data from a summary level to a detailed level.
C. joining data from one source into various sources of data.
D. separating data from one source into various sources of data.
Answer» A. a process to change data from a detailed level to a summary level.
235.

The _____________ is called a multi field transformation.

A. conversion of data from one field into multiple fields.
B. conversion of data from fields into field.
C. conversion of data from double fields into multiple fields
D. conversion of data from one field to one field.
Answer» A. conversion of data from one field into multiple fields.
236.

The type of relationship in star schema is ____________.

A. many-to-many.
B. one-to-one.
C. one-to-many.
D. many-to-one.
Answer» C. one-to-many.
237.

Fact tables are _____________.

A. completely demoralized.
B. partially demoralized.
C. completely normalized.
D. partially normalized.
Answer» C. completely normalized.
238.

Which of the following table type belongs to snowflake schema

A. Free.
B. Dimension.
C. Double.
D. Replicator.
Answer» B. Dimension.
239.

The goal of data mining is _________.

A. to explain some observed event or condition.
B. to confirm that data exists.
C. to analyze data for expected relationships.
D. to create a new data warehouse.
Answer» A. to explain some observed event or condition.
240.

Which of the following information systems are used in the daily running of the business

A. Operational planning systems.
B. Transaction processing systems (TPS).
C. Process control systems.
D. Office automation systems (OAS).
Answer» A. Operational planning systems.
241.

Which of the following is not one of the three Cs describing groupware

A. Collaboration.
B. Communication.
C. Co-ordination.
D. Compliance.
Answer» D. Compliance.
242.

Materials Requirements Planning (MRP) software is an example of an information systems application in which of the following areas

A. Operations Management.
B. Accounting.
C. Marketing.
D. Human Resource.
Answer» A. Operations Management.
243.

Which of the following is not one of the three main components in a decision support system

A. Model.
B. Communications.
C. Data.
D. Dialogue.
Answer» B. Communications.
244.

Business Intelligence and data warehousing is used for which of the following

A. Forecasting.
B. Data Mining.
C. Analysis of large volumes of product sales data.
D. All of the above.
Answer» D. All of the above.
245.

Which of the following is true about ERP software

A. Simplified support and maintenance through a supplier.
B. Extensive skills available for development.
C. Supplier costs kept low.
D. Always adaptable to how the business works.
Answer» A. Simplified support and maintenance through a supplier.
246.

Which of the following is an EIS

A. Electronic information system.
B. Executive information system.
C. Extended information system.
D. Electronic interface system.
Answer» B. Executive information system.
247.

What is the role of document image processing (DIP) systems

A. Cash flow forecasting.
B. Inventory control.
C. Conversion into digital format.
D. Producing customer invoices.
Answer» C. Conversion into digital format.
248.

Which of the following uses a series of logically related two-dimensional tables or files to store information in the form of a database

A. Database.
B. Database management system.
C. Data warehouse.
D. None of the above.
Answer» D. None of the above.
249.

What DBMS component contains facilities to help you develop transaction-intensive applications

A. DBMS engine.
B. Data definition subsystem.
C. Application generation subsystem.
D. Data administration subsystem.
Answer» C. Application generation subsystem.
250.

Which of the following is a data manipulation tool

A. File generators.
B. Query by example tool.
C. Structure question language.
D. Structure query language.
Answer» B. Query by example tool.
251.

The data administration subsystem helps you perform all of the following, except______.

A. backups and recovery.
B. query optimization.
C. security management.
D. create, change, and delete information.
Answer» D. create, change, and delete information.
252.

Which data administration subsystem periodically backs up information contained in a database

A. Concurrency control facilities.
B. Reorganization facilities.
C. Backup and recovery facilities.
D. Security management facilities.
Answer» C. Backup and recovery facilities.
253.

Which of the following is true of three-tier data warehouses

A. Once created, the data marts will keep on being updated from the data warehouse at periodic times.
B. Once created, the data marts will directly receive their new data from the operational databases.
C. The data marts are different groups of tables in the data warehouse.
D. A data mart becomes a data warehouse when it reaches a critical size.
Answer» A. Once created, the data marts will keep on being updated from the data warehouse at periodic times.
254.

Which of the given technology is not well-suited for data mining

A. Expert system technology.
B. Data visualization.
C. Technology limited to specific data types such as numeric data types.
D. Parallel architecture.
Answer» C. Technology limited to specific data types such as numeric data types.
255.

What is true about the multidimensional model

A. It typically requires less disk storage.
B. It typically requires more disk storage.
C. Typical business queries requiring aggregate functions take more time.
D. Typical business queries requiring aggregate functions take more time.
Answer» B. It typically requires more disk storage.
256.

The most common source of change data in refreshing a data warehouse is__________.

A. queryable change data.
B. cooperative change data.
C. logged change data.
D. snapshot change data.
Answer» A. queryable change data.
257.

Which of the following statements is not true about refreshing a data warehouse

A. It is a process of managing timing differences between the updating of data sources and the related data warehouse objects.
B. Updates to dimension tables may occur at different times than the fact table.
C. The data warehouse administrator has more control over the load time lag than the valid time lag.
D. None of the above.
Answer» D. None of the above.
258.

The ________ is responsible for running queries and reports against data warehouse tables.

A. hardware.
B. software.
C. end users.
D. middleware.
Answer» C. end users.
259.

Query tool is meant for __________.

A. data acquisition.
B. information delivery.
C. information exchange.
D. communication.
Answer» A. data acquisition.
260.

Which of the following function involves data cleaning, data standardizing and summarizing

A. Storing data.
B. Transforming data.
C. Data acquisition.
D. Data Access.
Answer» B. Transforming data.
261.

Which of the following clustering analysis method uses multi resolution approach

A. STUNT.
B. OPTICS.
C. CLIQUE.
D. Wave Cluster.
Answer» D. Wave Cluster.
262.

Which type of following clustering computes augmented cluster ordering

A. OPTICS.
B. CLIQUE.
C. STING.
D. CLUSTER.
Answer» A. OPTICS.
263.

Data compression is to compress the given data by encoding in terms of _________.

A. bytes.
B. bits.
C. cluster.
D. group.
Answer» C. cluster.
264.

Which of the following feature is supported by the physical design of data warehouse

A. Support backup and recovery.
B. Planning.
C. Analysis.
D. Coding.
Answer» A. Support backup and recovery.
265.

Classification rules are extracted from__________.

A. root node.
B. decision tree.
C. siblings.
D. branches.
Answer» B. decision tree.
266.

Dimensionality reduction reduces the data set size by removing _________.

A. relevant attributes.
B. irrelevant attributes.
C. derived attributes.
D. composite attributes.
Answer» B. irrelevant attributes.
267.

___________is a method of incremental conceptual clustering.

A. CORBA.
B. OLAP.
C. COBWEB.
D. STING.
Answer» C. COBWEB.
268.

Effect of one attribute value on a given class is independent of values of other attribute is called__________.

A. value independence.
B. class conditional independence.
C. conditional independence.
D. unconditional
Answer» A. value independence.
269.

Which of the following are special programs that are stored on database and fired when certain predefined action occurs

A. Snapshots.
B. Relations.
C. Triggers.
D. Cursors.
Answer» C. Triggers.
270.

The _________ refers to extracting knowledge from larger amount of data.

A. data abstraction.
B. data warehouse.
C. database.
D. data mining.
Answer» D. data mining.
271.

___________ is a knowledge discovery process.

A. Data cleaning.
B. Data warehousing.
C. Data mining.
D. Data transformation.
Answer» A. Data cleaning.
272.

OLAP is used for __________.

A. online application processing.
B. online analytical processing.
C. online aptitude processing.
D. online administration and processing.
Answer» B. online analytical processing.
273.

Maintenance of cache consistency is the limitation of _____________.

A. NUMA.
B. UNAM.
C. MPP.
D. PMP.
Answer» C. MPP.
274.

Data warehouse architecture is based on _______________.

A. DBMS.
B. RDBMS.
C. Sybase.
D. SQL Server.
Answer» B. RDBMS.
275.

Source data from the warehouse comes from ___________.

A. ODS.
B. TDS.
C. MDDB.
D. ORDBMS.
Answer» A. ODS.
276.

The ___________is a data transformation process.

A. comparison.
B. projection.
C. selection.
D. filtering.
Answer» D. filtering.
277.

Technology area associated with CRM is _____________.

A. specialization.
B. generalization.
C. personalization.
D. summarization.
Answer» C. personalization.
278.

SMP stands for __________.

A. symmetric multiprocessor.
B. symmetric multiprogramming.
C. symmetric meta programming.
D. symmetric microprogramming.
Answer» A. symmetric multiprocessor.
279.

The __________is designed to overcome any limitations placed on the warehouse by the nature of the relational data model.

A. operational database.
B. relational database.
C. multidimensional database.
D. data repository.
Answer» C. multidimensional database.
280.

MDDB stands for _________.

A. multiple data doubling.
B. multidimensional databases.
C. multiple double dimension.
D. multi-dimension doubling.
Answer» B. multidimensional databases.
281.

__________ is data about data.

A. Metadata.
B. Microdata.
C. Minidata
D. Multidata.
Answer» A. Metadata.
282.

The ___________ is an important functional component of the metadata.

A. digital directory.
B. repository.
C. information directory.
D. data dictionary.
Answer» C. information directory.
283.

EIS stands for __________.

A. extended interface system.
B. executive interface system.
C. executive information system.
D. extendable information system.
Answer» C. executive information system.
284.

____________ is data collected from natural systems.

A. MRI scan.
B. ODS data.
C. Statistical data.
D. Historical data.
Answer» A. MRI scan.
285.

Which one of the following feature is provided by an effective data visualization tool

A. Capability to update data.
B. Capability to delete or alter data.
C. Capability in append data.
D. Capability to compare data
Answer» D. Capability to compare data
286.

The __________ are inexpensive desktop tools designed for end users.

A. production tools.
B. report writers.
C. query tools.
D. application tools.
Answer» B. report writers.
287.

___________is an example of application development environments.

A. Visual Basic.
B. Oracle.
C. Sybase.
D. SQL Server.
Answer» A. Visual Basic.
288.

Which of the following is not an issue related to concept learning

A. Supervised learning.
B. Unsupervised learning.
C. Self learning.
D. Concept learning.
Answer» D. Concept learning.
289.

___________are some popular OLAP tools.

A. Metacube, informix.
B. Oracle express, essbase.
C. HOLAP.
D. MOLAP.
Answer» A. Metacube, informix.
290.

Capability of data mining is to build __________ models.

A. retrospective.
B. interrogative.
C. predictive.
D. imperative.
Answer» C. predictive.
291.

The _________is a process of determining the preference of customers majority.

A. association.
B. preferencing.
C. segmentation.
D. classification.
Answer» B. preferencing.
292.

Which of the following is a process to humanize the mass of data

A. Visualizing.
B. Segmentation.
C. Discovery.
D. Correction.
Answer» A. Visualizing.
293.

Strategic value of data mining is___________.

A. cost-sensitive.
B. work-sensitive.
C. time-sensitive.
D. technical-sensitive.
Answer» C. time-sensitive.
294.

Which of the following is a data store that is a subsidiary of a data warehouse of integrated data

A. Data mart.
B. Data store.
C. Database.
D. Data house.
Answer» A. Data mart.
295.

_________proposed the approach for data integration issues.

A. Ralph Campbell.
B. Ralph Kimball.
C. John Raphlin.
D. James Gosling.
Answer» B. Ralph Kimball.
296.

The terms equality and roll up are associated with___________.

A. OLAP.
B. visualization.
C. data mart.
D. decision tree.
Answer» C. data mart.
297.

Exceptional reporting in data warehousing is otherwise called as_______.

A. exception.
B. alerts.
C. errors.
D. bugs.
Answer» B. alerts.
298.

_________ is a metadata repository.

A. Prism solution directory manager.
B. CORBA.
C. STUNT.
D. COBWEB.
Answer» A. Prism solution directory manager.
299.

Which year was the beginning of new generation of machine learning

A. 1880.
B. 1980.
C. 1970.
D. 1960.
Answer» C. 1970.
300.

The __________ is an expensive process in building an expert system.

A. analysis.
B. study.
C. design.
D. information collection.
Answer» D. information collection.
Tags
Question and answers in Data Mining, Data Mining multiple choice questions and answers, Data Mining Important MCQs, Solved MCQs for Data Mining, Data Mining MCQs with answers PDF download