88
76.7k

320+ Data Mining Solved MCQs

These multiple-choice questions (MCQs) are designed to enhance your knowledge and understanding in the following areas: Bachelor of Science in Computer Science TY (BSc CS) , Master of Science in Computer Science (MSc CS) , Bachelor of Science in Computer Science (BSc CS) .

101.

___________ percentage of the interesting information can be obtained by using SQL.

A. 80
B. 70
C. 40
D. 50
Answer» A. 80
102.

________ is the technique which is used for discovering patterns in dataset at the beginning of data mining process.

A. Kohenon map.
B. Visualization.
C. OLAP.
D. SQL.
Answer» B. Visualization.
103.

In K-nearest neighbor algorithm K stands for ________.

A. number of neighbors that are investigated.
B. number of iterations.
C. number of total records.
D. random number.
Answer» A. number of neighbors that are investigated.
104.

The complexity of data mining algorithm is represented by ________.

A. log n.
B. 2n log n.
C. n log n.
D. 2 log n.
Answer» C. n log n.
105.

Genetic algorithm was proposed by _______.

A. John Holland.
B. Johnson.
C. Watson.
D. Kohenon.
Answer» A. John Holland.
106.

________ is the first stage in genetic algorithm.

A. Evaluation of each string.
B. Selection of string.
C. Creation of population of string.
D. Genetic manipulation.
Answer» C. Creation of population of string.
107.

The _________ is one of genetic operators that are used to recombine the population of genetic material.

A. genetic operator.
B. mutation operator.
C. cross over operator.
D. encoding operator.
Answer» A. genetic operator.
108.

_______ is the heart of knowledge discovery in database process.

A. Selection.
B. Data ware house.
C. Data mining.
D. Creative coding.
Answer» D. Creative coding.
109.

______ is a planning optimization application written for KLM

A. PILOTS.
B. CAPTAINS.
C. CUSTOMERS.
D. AIRLINES.
Answer» B. CAPTAINS.
110.

EIS stands for _________.

A. Executive Information System.
B. Exchange of Information System.
C. Extra Information System.
D. Extended Information system.
Answer» A. Executive Information System.
111.

Foreign key constraints are also referred as _______.

A. consistency constraints.
B. referential integrity.
C. conditional integrity.
D. domain constraints.
Answer» B. referential integrity.
112.

The set of attribute in a database that refers to data in another table is called ______.

A. primary key.
B. candidate key.
C. foreign key.
D. super key.
Answer» C. foreign key.
113.

The distance between two points that is calculated using Pythagoras theorem is _________.

A. cartesian distance.
B. eucledian distance.
C. extendable distance.
D. heuristic distance.
Answer» B. eucledian distance.
114.

A database containing volatile data used for daily operation of an organization is ______.

A. historic data.
B. metadata.
C. knowledge.
D. operational data.
Answer» D. operational data.
115.

The system that can be used without knowledge of internal operation _______.

A. black box.
B. white box.
C. case based learning.
D. bias.
Answer» A. black box.
116.

______ is the relationship between compressibility and learnability.

A. Maximum description length principle.
B. Minimum description length principle.
C. Kolmogorov complexity.
D. Voronoi principle.
Answer» B. Minimum description length principle.
117.

In KDD and data mining, noise is referred to as ________.

A. repeated data.
B. complex data.
C. meta data.
D. random errors in database.
Answer» D. random errors in database.
118.

DSS stands for _______.

A. Deciding Support System.
B. Decision Support System.
C. Decision Software System.
D. Decision System of System.
Answer» B. Decision Support System.
119.

Data mining algorithms require ___________

A. efficient sampling method.
B. storage of intermediate results.
C. capacity to handle large amounts of data.
D. All of the above.
Answer» D. All of the above.
120.

The algorithm that need to access a table several times during execution is_______.

A. n-table scan algorithm.
B. zoom scan algorithm.
C. hybrid algorithm.
D. nearest neighbor search.
Answer» A. n-table scan algorithm.
121.

A coding operation in which an attribute with cardinality n is replaced by n binary attributes is called as ______.

A. falsification of table.
B. enrichment of table.
C. flattening of table.
D. fuzzification of table.
Answer» C. flattening of table.
122.

The un-normalized relation containing all attributes that exist in database is ______.

A. actual relation.
B. transparent relation.
C. verified relation.
D. universal relation.
Answer» D. universal relation.
123.

The technique of learning by generalizing from examples is ________.

A. incremental learning.
B. inductive learning.
C. hybrid learning.
D. generalized learning.
Answer» B. inductive learning.
124.

The ever increasing amount of data is compared to that of infinite library by Jorge Louis Borges in his short stories namely _________.

A. the library of Louis.
B. the library of Borges.
C. the library of Babel.
D. the library of Boulevard.
Answer» C. the library of Babel.
125.

______ itself has become a production factor of importance.

A. Data.
B. Information.
C. Program.
D. Algorithm.
Answer» B. Information.
126.

The _______ plays an important role in artificial intelligence.

A. programming skill.
B. scheduling.
C. planning.
D. learning capabilities.
Answer» D. learning capabilities.
127.

Knowledge discovery in database refers to _____.

A. whole process of extraction of knowledge from data.
B. selection of data.
C. coding.
D. cleaning the data.
Answer» A. whole process of extraction of knowledge from data.
128.

Data mining is used to refer ______ stage in knowledge discovery in database.

A. selection.
B. retrieving.
C. discovery.
D. coding.
Answer» C. discovery.
129.

Query tools and data mining tools are _______.

A. same.
B. different.
C. complementary.
D. standard.
Answer» C. complementary.
130.

In genetic algorithm the problem is considered in terms of _________.

A. values.
B. points in multidimensional space.
C. node.
D. strings of characters.
Answer» D. strings of characters.
131.

In UK,_______ has applied data mining techniques to analyze viewing figures. a. a press .

A. press
B. BBC
C. CNN
D. NDT
Answer» B. BBC
132.

In K- nearest neighbor the input is translated to __________.

A. values
B. points in multidimensional space
C. strings of characters
D. nodes
Answer» B. points in multidimensional space
133.

In machine learning ________ phase try to find the patterns from observations.

A. observation
B. theory
C. analysis
D. prediction
Answer» C. analysis
134.

__________________refers to the process of deriving high-quality information from text.

A. Text Mining.
B. Image Mining.
C. Database Mining.
D. Multimedia Mining.
Answer» A. Text Mining.
135.

The process of selecting good hypothesis and improving the theory based on this is called _______.

A. heuristic search
B. hill climbing algorithm.
C. incremental search.
D. apriori algorithm
Answer» B. hill climbing algorithm.
136.

_____________ is the application of data mining techniques to discover patterns from the Web.

A. Text Mining.
B. Multimedia Mining.
C. Web Mining.
D. Link Mining.
Answer» C. Web Mining.
137.

It is important to know the complexity of the _______ before developing any machine learning algorithm.

A. data
B. algorithm
C. search space
D. learning
Answer» C. search space
138.

Information content is closely related to ______ and transparency.

A. algorithm.
B. search space.
C. learning.
D. statistical significance.
Answer» D. statistical significance.
139.

The ________ is used to express the hypothesis describing the concept.

A. computer language.
B. algorithm.
C. definition.
D. theory
Answer» A. computer language.
140.

A definition of a concept is complete if it recognizes _________.

A. all the information.
B. all the instances of a concept.
C. only positive examples.
D. negative examples.
Answer» B. all the instances of a concept.
141.

The results of machine learning algorithms are always have to be checked for their _________.

A. observations.
B. calculations
C. programs.
D. statistical relevance.
Answer» D. statistical relevance.
142.

A ________ is necessary condition for KDDs effective implement.

A. data set.
B. database.
C. data warehouse.
D. data.
Answer» C. data warehouse.
143.

The first international KDD conference was held in the year ________.

A. 1995.
B. 1994.
C. 1993.
D. 1992.
Answer» A. 1995.
144.

AI stands for ____.

A. art of interest.
B. artificial interest.
C. art of intelligence.
D. artificial intelligence.
Answer» D. artificial intelligence.
145.

KDD is a ________.

A. new technology that is use to store data.
B. multidisciplinary field of research.
C. database technology.
D. expert system.
Answer» B. multidisciplinary field of research.
146.

______ could generate rule automatically.

A. KDD.
B. machine learning.
C. artificial intelligence.
D. expert system.
Answer» B. machine learning.
147.

Intelligent miner is a mining tool from _______.

A. Clementine.
B. living stones.
C. IBM.
D. Wipro.
Answer» C. IBM.
148.

The organization such as ______ is in USA.

A. AT & T.
B. AD & T.
C. AA & T.
D. AT & D.
Answer» A. AT & T.
149.

________ is a mining tool from integral solutions.

A. WEKA
B. web miner.
C. rapid miner.
D. clementine.
Answer» D. clementine.
150.

________ % of KDD is about preparing data.

A. 60.
B. 70
C. 80
D. 90
Answer» C. 80

Done Studing? Take A Test.

Great job completing your study session! Now it's time to put your knowledge to the test. Challenge yourself, see how much you've learned, and identify areas for improvement. Don’t worry, this is all part of the journey to mastery. Ready for the next step? Take a quiz to solidify what you've just studied.