320+ Data Mining Solved MCQs

101.

___________ percentage of the interesting information can be obtained by using SQL.

A. 80
B. 70
C. 40
D. 50
Answer» A. 80
102.

________ is the technique which is used for discovering patterns in dataset at the beginning of data mining process.

A. Kohenon map.
B. Visualization.
C. OLAP.
D. SQL.
Answer» B. Visualization.
103.

In K-nearest neighbor algorithm K stands for ________.

A. number of neighbors that are investigated.
B. number of iterations.
C. number of total records.
D. random number.
Answer» A. number of neighbors that are investigated.
104.

The complexity of data mining algorithm is represented by ________.

A. log n.
B. 2n log n.
C. n log n.
D. 2 log n.
Answer» C. n log n.
105.

Genetic algorithm was proposed by _______.

A. John Holland.
B. Johnson.
C. Watson.
D. Kohenon.
Answer» A. John Holland.
106.

________ is the first stage in genetic algorithm.

A. Evaluation of each string.
B. Selection of string.
C. Creation of population of string.
D. Genetic manipulation.
Answer» C. Creation of population of string.
107.

The _________ is one of genetic operators that are used to recombine the population of genetic material.

A. genetic operator.
B. mutation operator.
C. cross over operator.
D. encoding operator.
Answer» A. genetic operator.
108.

_______ is the heart of knowledge discovery in database process.

A. Selection.
B. Data ware house.
C. Data mining.
D. Creative coding.
Answer» D. Creative coding.
109.

______ is a planning optimization application written for KLM

A. PILOTS.
B. CAPTAINS.
C. CUSTOMERS.
D. AIRLINES.
Answer» B. CAPTAINS.
110.

EIS stands for _________.

A. Executive Information System.
B. Exchange of Information System.
C. Extra Information System.
D. Extended Information system.
Answer» A. Executive Information System.
111.

Foreign key constraints are also referred as _______.

A. consistency constraints.
B. referential integrity.
C. conditional integrity.
D. domain constraints.
Answer» B. referential integrity.
112.

The set of attribute in a database that refers to data in another table is called ______.

A. primary key.
B. candidate key.
C. foreign key.
D. super key.
Answer» C. foreign key.
113.

The distance between two points that is calculated using Pythagoras theorem is _________.

A. cartesian distance.
B. eucledian distance.
C. extendable distance.
D. heuristic distance.
Answer» B. eucledian distance.
114.

A database containing volatile data used for daily operation of an organization is ______.

A. historic data.
B. metadata.
C. knowledge.
D. operational data.
Answer» D. operational data.
115.

The system that can be used without knowledge of internal operation _______.

A. black box.
B. white box.
C. case based learning.
D. bias.
Answer» A. black box.
116.

______ is the relationship between compressibility and learnability.

A. Maximum description length principle.
B. Minimum description length principle.
C. Kolmogorov complexity.
D. Voronoi principle.
Answer» B. Minimum description length principle.
117.

In KDD and data mining, noise is referred to as ________.

A. repeated data.
B. complex data.
C. meta data.
D. random errors in database.
Answer» D. random errors in database.
118.

DSS stands for _______.

A. Deciding Support System.
B. Decision Support System.
C. Decision Software System.
D. Decision System of System.
Answer» B. Decision Support System.
119.

Data mining algorithms require ___________

A. efficient sampling method.
B. storage of intermediate results.
C. capacity to handle large amounts of data.
D. All of the above.
Answer» D. All of the above.
120.

The algorithm that need to access a table several times during execution is_______.

A. n-table scan algorithm.
B. zoom scan algorithm.
C. hybrid algorithm.
D. nearest neighbor search.
Answer» A. n-table scan algorithm.
121.

A coding operation in which an attribute with cardinality n is replaced by n binary attributes is called as ______.

A. falsification of table.
B. enrichment of table.
C. flattening of table.
D. fuzzification of table.
Answer» C. flattening of table.
122.

The un-normalized relation containing all attributes that exist in database is ______.

A. actual relation.
B. transparent relation.
C. verified relation.
D. universal relation.
Answer» D. universal relation.
123.

The technique of learning by generalizing from examples is ________.

A. incremental learning.
B. inductive learning.
C. hybrid learning.
D. generalized learning.
Answer» B. inductive learning.
124.

The ever increasing amount of data is compared to that of infinite library by Jorge Louis Borges in his short stories namely _________.

A. the library of Louis.
B. the library of Borges.
C. the library of Babel.
D. the library of Boulevard.
Answer» C. the library of Babel.
125.

______ itself has become a production factor of importance.

A. Data.
B. Information.
C. Program.
D. Algorithm.
Answer» B. Information.
126.

The _______ plays an important role in artificial intelligence.

A. programming skill.
B. scheduling.
C. planning.
D. learning capabilities.
Answer» D. learning capabilities.
127.

Knowledge discovery in database refers to _____.

A. whole process of extraction of knowledge from data.
B. selection of data.
C. coding.
D. cleaning the data.
Answer» A. whole process of extraction of knowledge from data.
128.

Data mining is used to refer ______ stage in knowledge discovery in database.

A. selection.
B. retrieving.
C. discovery.
D. coding.
Answer» C. discovery.
129.

Query tools and data mining tools are _______.

A. same.
B. different.
C. complementary.
D. standard.
Answer» C. complementary.
130.

In genetic algorithm the problem is considered in terms of _________.

A. values.
B. points in multidimensional space.
C. node.
D. strings of characters.
Answer» D. strings of characters.
131.

In UK,_______ has applied data mining techniques to analyze viewing figures. a. a press .

A. press
B. BBC
C. CNN
D. NDT
Answer» B. BBC
132.

In K- nearest neighbor the input is translated to __________.

A. values
B. points in multidimensional space
C. strings of characters
D. nodes
Answer» B. points in multidimensional space
133.

In machine learning ________ phase try to find the patterns from observations.

A. observation
B. theory
C. analysis
D. prediction
Answer» C. analysis
134.

__________________refers to the process of deriving high-quality information from text.

A. Text Mining.
B. Image Mining.
C. Database Mining.
D. Multimedia Mining.
Answer» A. Text Mining.
135.

The process of selecting good hypothesis and improving the theory based on this is called _______.

A. heuristic search
B. hill climbing algorithm.
C. incremental search.
D. apriori algorithm
Answer» B. hill climbing algorithm.
136.

_____________ is the application of data mining techniques to discover patterns from the Web.

A. Text Mining.
B. Multimedia Mining.
C. Web Mining.
D. Link Mining.
Answer» C. Web Mining.
137.

It is important to know the complexity of the _______ before developing any machine learning algorithm.

A. data
B. algorithm
C. search space
D. learning
Answer» C. search space
138.

Information content is closely related to ______ and transparency.

A. algorithm.
B. search space.
C. learning.
D. statistical significance.
Answer» D. statistical significance.
139.

The ________ is used to express the hypothesis describing the concept.

A. computer language.
B. algorithm.
C. definition.
D. theory
Answer» A. computer language.
140.

A definition of a concept is complete if it recognizes _________.

A. all the information.
B. all the instances of a concept.
C. only positive examples.
D. negative examples.
Answer» B. all the instances of a concept.
141.

The results of machine learning algorithms are always have to be checked for their _________.

A. observations.
B. calculations
C. programs.
D. statistical relevance.
Answer» D. statistical relevance.
142.

A ________ is necessary condition for KDDs effective implement.

A. data set.
B. database.
C. data warehouse.
D. data.
Answer» C. data warehouse.
143.

The first international KDD conference was held in the year ________.

A. 1995.
B. 1994.
C. 1993.
D. 1992.
Answer» A. 1995.
144.

AI stands for ____.

A. art of interest.
B. artificial interest.
C. art of intelligence.
D. artificial intelligence.
Answer» D. artificial intelligence.
145.

KDD is a ________.

A. new technology that is use to store data.
B. multidisciplinary field of research.
C. database technology.
D. expert system.
Answer» B. multidisciplinary field of research.
146.

______ could generate rule automatically.

A. KDD.
B. machine learning.
C. artificial intelligence.
D. expert system.
Answer» B. machine learning.
147.

Intelligent miner is a mining tool from _______.

A. Clementine.
B. living stones.
C. IBM.
D. Wipro.
Answer» C. IBM.
148.

The organization such as ______ is in USA.

A. AT & T.
B. AD & T.
C. AA & T.
D. AT & D.
Answer» A. AT & T.
149.

________ is a mining tool from integral solutions.

A. WEKA
B. web miner.
C. rapid miner.
D. clementine.
Answer» D. clementine.
150.

________ % of KDD is about preparing data.

A. 60.
B. 70
C. 80
D. 90
Answer» C. 80
151.

The ______ is one of the operation research techniques.

A. association rules.
B. k-nearest neighbor.
C. decision trees.
D. genetic algorithm.
Answer» B. k-nearest neighbor.
152.

Everything that science discovers has only ______ value.

A. standard.
B. different.
C. same.
D. temporary.
Answer» D. temporary.
153.

A good introduction to machine learning is the idea of ______.

A. concept learning.
B. content learning.
C. theory of falsification.
D. Poppers law.
Answer» A. concept learning.
154.

The algorithms that are controlled by human during their execution is _______ algorithm.

A. unsupervised.
B. supervised.
C. batch learning.
D. incremental.
Answer» B. supervised.
155.

Background knowledge depends on the form of ______________.

A. theoretical knowledge.
B. hypothesis.
C. formulae.
D. knowledge representation.
Answer» D. knowledge representation.
156.

Bias helps to ______.

A. learn.
B. complete the search.
C. execute the search.
D. constrain the search and utilizes KDD to analyze client files.
Answer» D. constrain the search and utilizes KDD to analyze client files.
157.

A _____ algorithm takes all the data at once and tries to create a hypothesis based on this data.

A. supervised.
B. batch learning.
C. unsupervised.
D. incremental learning.
Answer» B. batch learning.
158.

A ________ algorithm takes a new piece of information at each learning cycle and tries to revise the theory using new data.

A. supervised.
B. batch learning.
C. unsupervised.
D. incremental learning.
Answer» B. batch learning.
159.

The _________ forms the background knowledge in the inductive logic programming.

A. prolog program.
B. perl.
C. python.
D. ruby.
Answer» A. prolog program.
160.

In KDD process _______ % is about mining.

A. 40.
B. 30.
C. 20.
D. 10.
Answer» C. 20.
161.

________ is used to find the vaguely known data.

A. SQL.
B. KDD.
C. Data mining.
D. Sybase.
Answer» C. Data mining.
162.

A definition of a concept is _______ if it does not classify any negative examples as falling under the concept.

A. complete.
B. consistent.
C. good.
D. bad.
Answer» B. consistent.
163.

Lot of kangaroo jumping around the country side is an example for ________.

A. parallelism.
B. concept learning.
C. machine learning.
D. data mining.
Answer» A. parallelism.
164.

The easiest way to gain access to the data and facilitate effective decision making is to set up a _______.

A. database.
B. data mart.
C. data warehouse.
D. operational.
Answer» C. data warehouse.
165.

Smaller local data warehouse is called as ____.

A. data mart.
B. database.
C. data model.
D. meta data.
Answer» B. database.
166.

Data warehouse is only used for _____.

A. operating the data.
B. managing the data.
C. decision making.
D. queries.
Answer» D. queries.
167.

The _______ data are stored in data warehouse.

A. operational.
B. historical.
C. transactional.
D. optimized.
Answer» B. historical.
168.

A decision support system is a system that ________.

A. can constantly change over time.
B. cannot change.
C. copies the data.
D. supports the system.
Answer» A. can constantly change over time.
169.

Metadata is used by the end users for ______.

A. managing database.
B. structuring database.
C. querying purposes.
D. making decisions.
Answer» C. querying purposes.
170.

The _________ techniques are used to load information from operational database to data warehouse.

A. reengineering.
B. reverse.
C. transfer.
D. replication.
Answer» D. replication.
171.

The __________ represents the best choice for building a data warehouse.

A. client/server.
B. database.
C. bottom up.
D. visualization.
Answer» A. client/server.
172.

The __________ is one of database that operates on massively parallel computer.

A. sybase.
B. SQL.
C. postgre SQL.
D. tandem.
Answer» D. tandem.
173.

________ is more recent expert system.

A. Mycin.
B. Gasoil.
C. BMT.
D. XCONVAX.
Answer» B. Gasoil.
174.

A ______ is not the rule that govern the basic structure of data warehouse.

A. time dependent.
B. volatile.
C. subject oriented.
D. integrated.
Answer» B. volatile.
175.

The metadata that is generated at the time of building a warehouse is called ______.

A. Build time metadata.
B. Usage metadata.
C. Control metadata.
D. structure metadata.
Answer» A. Build time metadata.
176.

The control metadata is used to _______.

A. design a metadata.
B. administrate the metadata.
C. track the sequence and timing of warehouse events.
D. control the data.
Answer» C. track the sequence and timing of warehouse events.
177.

A data warehouse is said to contain a time-varying collection of data because ___.

A. its contents vary automatically with time.
B. its lifespan is very limited.
C. it contains historical data.
D. its content has explicit stamp.
Answer» C. it contains historical data.
178.

A data warehouse is an integrated collection of data because _____.

A. it is a collection of data of different data types.
B. it is a collection of data derived from multiple sources.
C. it is a relational database.
D. it contains summarized data.
Answer» B. it is a collection of data derived from multiple sources.
179.

Expert systems are ________.

A. system that contain the knowledge of specialists.
B. system that can think of their own.
C. system that can work.
D. system that can create the knowledge.
Answer» A. system that contain the knowledge of specialists.
180.

_______ is an expert who analyzed the effect of using machine learning algorithm in setting up expert system.

A. Borges.
B. Popper.
C. Bratko.
D. Papert.
Answer» C. Bratko.
181.

The element that is not taken into consideration for cost justification for the implementation of KDD environment is _______.

A. speed.
B. cost.
C. complexity.
D. repetition.
Answer» B. cost.
182.

A ______ is an interactive system that enables decision makers to use database and models on a computer in order to solve ill structured problems.

A. database.
B. client/server.
C. DSS.
D. mainframe
Answer» C. DSS.
183.

The _______ is a symbolic representation of facts or ideas from which information can potentially be extracted.

A. knowledge.
B. data.
C. algorithm.
D. program.
Answer» B. data.
184.

DB/2 is a family of RDBMS marketed by _____.

A. HCL.
B. Wipro.
C. IBM.
D. Infosys.
Answer» C. IBM.
185.

A collection of interesting and useful patterns in database is called _______.

A. knowledge.
B. information.
C. data.
D. algorithm.
Answer» A. knowledge.
186.

In data mining software that works on local workstation is used to _______.

A. write coding.
B. generate screen and reports for the end user.
C. make decisions.
D. find patterns.
Answer» B. generate screen and reports for the end user.
187.

A ________ acts a bridge between data warehouse and database application.

A. data mart.
B. operational data.
C. meta data.
D. data cube.
Answer» C. meta data.
188.

The _____ operation is used for reducing data cube by one or more dimensions.

A. drilling.
B. rolling.
C. dicing.
D. slicing.
Answer» D. slicing.
189.

The main organizational justification for implementing a data warehouse is to provide ______.

A. cheaper ways of handling transportation.
B. decision support.
C. storing large volume of data.
D. access to data.
Answer» C. storing large volume of data.
190.

KDD consists of ______ stages.

A. four.
B. five.
C. six.
D. seven.
Answer» C. six.
191.

_______ is the first stage in KDD process.

A. Data selection.
B. Cleaning.
C. Mining.
D. Enrichment.
Answer» A. Data selection.
192.

The term that is not associated with data cleaning process is ______.

A. domain consistance.
B. de-duplication.
C. disambiguation.
D. segmentation.
Answer» D. segmentation.
193.

In _______ process of KDD additional information can be added to the existing data.

A. enrichment.
B. coding.
C. selecting.
D. cleaning.
Answer» A. enrichment.
194.

_______ is a type of coding operation that occurs frequently in KDD context.

A. Filtering.
B. Visualization.
C. Flattening.
D. Replication.
Answer» C. Flattening.
195.

SQL stands for ________.

A. simple query language.
B. structured query language.
C. strong query language.
D. simple language.
Answer» B. structured query language.
196.

_________ is one of the traditional query tool.

A. MYSQL.
B. OLAP.
C. PL/SQL.
D. SQL.
Answer» D. SQL.
197.

The _____ is a useful method of discovering patterns at the beginning of data mining process.

A. calculating distance.
B. visualization techniques.
C. decision trees.
D. association rules.
Answer» B. visualization techniques.
198.

A/An_____ is an object oriented 3D tool kit which enables the user to explore 3D structure.

A. inventor.
B. tandim.
C. mantis.
D. extruder.
Answer» A. inventor.
199.

The field of research dedicated to the search for interesting projections of datasets are called __________.

A. projection pursuit.
B. research pursuit.
C. projection.
D. dataset pursuit.
Answer» A. projection pursuit.
200.

Which of the following is correct order of empirical cycle of scientific research?

A. Analysis, observation, prediction, theory.
B. Analysis, prediction, theory, observation.
C. Analysis, prediction, observation, theory.
D. Analysis, observation, theory, prediction.
Answer» B. Analysis, prediction, theory, observation.
Tags
Question and answers in Data Mining, Data Mining multiple choice questions and answers, Data Mining Important MCQs, Solved MCQs for Data Mining, Data Mining MCQs with answers PDF download