320+ Data Mining Solved MCQs

These multiple-choice questions (MCQs) are designed to enhance your knowledge and understanding in the following areas: Bachelor of Science in Computer Science TY (BSc CS) , Master of Science in Computer Science (MSc CS) , Bachelor of Science in Computer Science (BSc CS) .

Take a Test

101.	___________ percentage of the interesting information can be obtained by using SQL.
A.	80
B.	70
C.	40
D.	50
Answer» A. 80

102.	________ is the technique which is used for discovering patterns in dataset at the beginning of data mining process.
A.	Kohenon map.
B.	Visualization.
C.	OLAP.
D.	SQL.
Answer» B. Visualization.

103.	In K-nearest neighbor algorithm K stands for ________.
A.	number of neighbors that are investigated.
B.	number of iterations.
C.	number of total records.
D.	random number.
Answer» A. number of neighbors that are investigated.

104.	The complexity of data mining algorithm is represented by ________.
A.	log n.
B.	2n log n.
C.	n log n.
D.	2 log n.
Answer» C. n log n.

105.	Genetic algorithm was proposed by _______.
A.	John Holland.
B.	Johnson.
C.	Watson.
D.	Kohenon.
Answer» A. John Holland.

106.	________ is the first stage in genetic algorithm.
A.	Evaluation of each string.
B.	Selection of string.
C.	Creation of population of string.
D.	Genetic manipulation.
Answer» C. Creation of population of string.

107.	The _________ is one of genetic operators that are used to recombine the population of genetic material.
A.	genetic operator.
B.	mutation operator.
C.	cross over operator.
D.	encoding operator.
Answer» A. genetic operator.

108.	_______ is the heart of knowledge discovery in database process.
A.	Selection.
B.	Data ware house.
C.	Data mining.
D.	Creative coding.
Answer» D. Creative coding.

109.	______ is a planning optimization application written for KLM
A.	PILOTS.
B.	CAPTAINS.
C.	CUSTOMERS.
D.	AIRLINES.
Answer» B. CAPTAINS.

110.	EIS stands for _________.
A.	Executive Information System.
B.	Exchange of Information System.
C.	Extra Information System.
D.	Extended Information system.
Answer» A. Executive Information System.

111.	Foreign key constraints are also referred as _______.
A.	consistency constraints.
B.	referential integrity.
C.	conditional integrity.
D.	domain constraints.
Answer» B. referential integrity.

112.	The set of attribute in a database that refers to data in another table is called ______.
A.	primary key.
B.	candidate key.
C.	foreign key.
D.	super key.
Answer» C. foreign key.

113.	The distance between two points that is calculated using Pythagoras theorem is _________.
A.	cartesian distance.
B.	eucledian distance.
C.	extendable distance.
D.	heuristic distance.
Answer» B. eucledian distance.

114.	A database containing volatile data used for daily operation of an organization is ______.
A.	historic data.
B.	metadata.
C.	knowledge.
D.	operational data.
Answer» D. operational data.

115.	The system that can be used without knowledge of internal operation _______.
A.	black box.
B.	white box.
C.	case based learning.
D.	bias.
Answer» A. black box.

116.	______ is the relationship between compressibility and learnability.
A.	Maximum description length principle.
B.	Minimum description length principle.
C.	Kolmogorov complexity.
D.	Voronoi principle.
Answer» B. Minimum description length principle.

117.	In KDD and data mining, noise is referred to as ________.
A.	repeated data.
B.	complex data.
C.	meta data.
D.	random errors in database.
Answer» D. random errors in database.

118.	DSS stands for _______.
A.	Deciding Support System.
B.	Decision Support System.
C.	Decision Software System.
D.	Decision System of System.
Answer» B. Decision Support System.

119.	Data mining algorithms require ___________
A.	efficient sampling method.
B.	storage of intermediate results.
C.	capacity to handle large amounts of data.
D.	All of the above.
Answer» D. All of the above.

120.	The algorithm that need to access a table several times during execution is_______.
A.	n-table scan algorithm.
B.	zoom scan algorithm.
C.	hybrid algorithm.
D.	nearest neighbor search.
Answer» A. n-table scan algorithm.

121.	A coding operation in which an attribute with cardinality n is replaced by n binary attributes is called as ______.
A.	falsification of table.
B.	enrichment of table.
C.	flattening of table.
D.	fuzzification of table.
Answer» C. flattening of table.

122.	The un-normalized relation containing all attributes that exist in database is ______.
A.	actual relation.
B.	transparent relation.
C.	verified relation.
D.	universal relation.
Answer» D. universal relation.

123.	The technique of learning by generalizing from examples is ________.
A.	incremental learning.
B.	inductive learning.
C.	hybrid learning.
D.	generalized learning.
Answer» B. inductive learning.

124.	The ever increasing amount of data is compared to that of infinite library by Jorge Louis Borges in his short stories namely _________.
A.	the library of Louis.
B.	the library of Borges.
C.	the library of Babel.
D.	the library of Boulevard.
Answer» C. the library of Babel.

125.	______ itself has become a production factor of importance.
A.	Data.
B.	Information.
C.	Program.
D.	Algorithm.
Answer» B. Information.

320+ Data Mining Solved MCQs

___________ percentage of the interesting information can be obtained by using SQL.

________ is the technique which is used for discovering patterns in dataset at the beginning of data mining process.

In K-nearest neighbor algorithm K stands for ________.

The complexity of data mining algorithm is represented by ________.

Genetic algorithm was proposed by _______.

________ is the first stage in genetic algorithm.

The _________ is one of genetic operators that are used to recombine the population of genetic material.

_______ is the heart of knowledge discovery in database process.

______ is a planning optimization application written for KLM

EIS stands for _________.

Foreign key constraints are also referred as _______.

The set of attribute in a database that refers to data in another table is called ______.

The distance between two points that is calculated using Pythagoras theorem is _________.

A database containing volatile data used for daily operation of an organization is ______.

The system that can be used without knowledge of internal operation _______.

______ is the relationship between compressibility and learnability.

In KDD and data mining, noise is referred to as ________.

DSS stands for _______.

Data mining algorithms require ___________

The algorithm that need to access a table several times during execution is_______.

A coding operation in which an attribute with cardinality n is replaced by n binary attributes is called as ______.

The un-normalized relation containing all attributes that exist in database is ______.

The technique of learning by generalizing from examples is ________.

The ever increasing amount of data is compared to that of infinite library by Jorge Louis Borges in his short stories namely _________.

______ itself has become a production factor of importance.

The _______ plays an important role in artificial intelligence.

Knowledge discovery in database refers to _____.

Data mining is used to refer ______ stage in knowledge discovery in database.

Query tools and data mining tools are _______.

In genetic algorithm the problem is considered in terms of _________.

In UK,_______ has applied data mining techniques to analyze viewing figures. a. a press .

In K- nearest neighbor the input is translated to __________.

In machine learning ________ phase try to find the patterns from observations.

__________________refers to the process of deriving high-quality information from text.

The process of selecting good hypothesis and improving the theory based on this is called _______.

_____________ is the application of data mining techniques to discover patterns from the Web.

It is important to know the complexity of the _______ before developing any machine learning algorithm.

Information content is closely related to ______ and transparency.

The ________ is used to express the hypothesis describing the concept.

A definition of a concept is complete if it recognizes _________.

The results of machine learning algorithms are always have to be checked for their _________.

A ________ is necessary condition for KDDs effective implement.

The first international KDD conference was held in the year ________.

AI stands for ____.

KDD is a ________.

______ could generate rule automatically.

Intelligent miner is a mining tool from _______.

The organization such as ______ is in USA.

________ is a mining tool from integral solutions.

________ % of KDD is about preparing data.

126.	The _______ plays an important role in artificial intelligence.
A.	programming skill.
B.	scheduling.
C.	planning.
D.	learning capabilities.
Answer» D. learning capabilities.

127.	Knowledge discovery in database refers to _____.
A.	whole process of extraction of knowledge from data.
B.	selection of data.
C.	coding.
D.	cleaning the data.
Answer» A. whole process of extraction of knowledge from data.

128.	Data mining is used to refer ______ stage in knowledge discovery in database.
A.	selection.
B.	retrieving.
C.	discovery.
D.	coding.
Answer» C. discovery.

129.	Query tools and data mining tools are _______.
A.	same.
B.	different.
C.	complementary.
D.	standard.
Answer» C. complementary.

130.	In genetic algorithm the problem is considered in terms of _________.
A.	values.
B.	points in multidimensional space.
C.	node.
D.	strings of characters.
Answer» D. strings of characters.

131.	In UK,_______ has applied data mining techniques to analyze viewing figures. a. a press .
A.	press
B.	BBC
C.	CNN
D.	NDT
Answer» B. BBC

132.	In K- nearest neighbor the input is translated to __________.
A.	values
B.	points in multidimensional space
C.	strings of characters
D.	nodes
Answer» B. points in multidimensional space

133.	In machine learning ________ phase try to find the patterns from observations.
A.	observation
B.	theory
C.	analysis
D.	prediction
Answer» C. analysis

134.	__________________refers to the process of deriving high-quality information from text.
A.	Text Mining.
B.	Image Mining.
C.	Database Mining.
D.	Multimedia Mining.
Answer» A. Text Mining.

135.	The process of selecting good hypothesis and improving the theory based on this is called _______.
A.	heuristic search
B.	hill climbing algorithm.
C.	incremental search.
D.	apriori algorithm
Answer» B. hill climbing algorithm.