Quiznetik
Data Mining | Set 2
1. _________ is an example for case based-learning.
A. Decision trees.
B. Neural networks.
C. Genetic algorithm.
D. K-nearest neighbor.
Correct : D. K-nearest neighbor.
2. ___________ percentage of the interesting information can be obtained by using SQL.
A. 80
B. 70
C. 40
D. 50
Correct : A. 80
3. ________ is the technique which is used for discovering patterns in dataset at the beginning of data mining process.
A. Kohenon map.
B. Visualization.
C. OLAP.
D. SQL.
Correct : B. Visualization.
4. In K-nearest neighbor algorithm K stands for ________.
A. number of neighbors that are investigated.
B. number of iterations.
C. number of total records.
D. random number.
Correct : A. number of neighbors that are investigated.
5. The complexity of data mining algorithm is represented by ________.
A. log n.
B. 2n log n.
C. n log n.
D. 2 log n.
Correct : C. n log n.
6. Genetic algorithm was proposed by _______.
A. John Holland.
B. Johnson.
C. Watson.
D. Kohenon.
Correct : A. John Holland.
7. ________ is the first stage in genetic algorithm.
A. Evaluation of each string.
B. Selection of string.
C. Creation of population of string.
D. Genetic manipulation.
Correct : C. Creation of population of string.
8. The _________ is one of genetic operators that are used to recombine the population of genetic material.
A. genetic operator.
B. mutation operator.
C. cross over operator.
D. encoding operator.
Correct : A. genetic operator.
9. _______ is the heart of knowledge discovery in database process.
A. Selection.
B. Data ware house.
C. Data mining.
D. Creative coding.
Correct : D. Creative coding.
10. ______ is a planning optimization application written for KLM
A. PILOTS.
B. CAPTAINS.
C. CUSTOMERS.
D. AIRLINES.
Correct : B. CAPTAINS.
11. EIS stands for _________.
A. Executive Information System.
B. Exchange of Information System.
C. Extra Information System.
D. Extended Information system.
Correct : A. Executive Information System.
12. Foreign key constraints are also referred as _______.
A. consistency constraints.
B. referential integrity.
C. conditional integrity.
D. domain constraints.
Correct : B. referential integrity.
13. The set of attribute in a database that refers to data in another table is called ______.
A. primary key.
B. candidate key.
C. foreign key.
D. super key.
Correct : C. foreign key.
14. The distance between two points that is calculated using Pythagoras theorem is _________.
A. cartesian distance.
B. eucledian distance.
C. extendable distance.
D. heuristic distance.
Correct : B. eucledian distance.
15. A database containing volatile data used for daily operation of an organization is ______.
A. historic data.
B. metadata.
C. knowledge.
D. operational data.
Correct : D. operational data.
16. The system that can be used without knowledge of internal operation _______.
A. black box.
B. white box.
C. case based learning.
D. bias.
Correct : A. black box.
17. ______ is the relationship between compressibility and learnability.
A. Maximum description length principle.
B. Minimum description length principle.
C. Kolmogorov complexity.
D. Voronoi principle.
Correct : B. Minimum description length principle.
18. In KDD and data mining, noise is referred to as ________.
A. repeated data.
B. complex data.
C. meta data.
D. random errors in database.
Correct : D. random errors in database.
19. DSS stands for _______.
A. Deciding Support System.
B. Decision Support System.
C. Decision Software System.
D. Decision System of System.
Correct : B. Decision Support System.
20. Data mining algorithms require ___________
A. efficient sampling method.
B. storage of intermediate results.
C. capacity to handle large amounts of data.
D. All of the above.
Correct : D. All of the above.
21. The algorithm that need to access a table several times during execution is_______.
A. n-table scan algorithm.
B. zoom scan algorithm.
C. hybrid algorithm.
D. nearest neighbor search.
Correct : A. n-table scan algorithm.
22. A coding operation in which an attribute with cardinality n is replaced by n binary attributes is called as ______.
A. falsification of table.
B. enrichment of table.
C. flattening of table.
D. fuzzification of table.
Correct : C. flattening of table.
23. The un-normalized relation containing all attributes that exist in database is ______.
A. actual relation.
B. transparent relation.
C. verified relation.
D. universal relation.
Correct : D. universal relation.
24. The technique of learning by generalizing from examples is ________.
A. incremental learning.
B. inductive learning.
C. hybrid learning.
D. generalized learning.
Correct : B. inductive learning.
25. The ever increasing amount of data is compared to that of infinite library by Jorge Louis Borges in his short stories namely _________.
A. the library of Louis.
B. the library of Borges.
C. the library of Babel.
D. the library of Boulevard.
Correct : C. the library of Babel.
26. ______ itself has become a production factor of importance.
A. Data.
B. Information.
C. Program.
D. Algorithm.
Correct : B. Information.
27. The _______ plays an important role in artificial intelligence.
A. programming skill.
B. scheduling.
C. planning.
D. learning capabilities.
Correct : D. learning capabilities.
28. Knowledge discovery in database refers to _____.
A. whole process of extraction of knowledge from data.
B. selection of data.
C. coding.
D. cleaning the data.
Correct : A. whole process of extraction of knowledge from data.
29. Data mining is used to refer ______ stage in knowledge discovery in database.
A. selection.
B. retrieving.
C. discovery.
D. coding.
Correct : C. discovery.
30. Query tools and data mining tools are _______.
A. same.
B. different.
C. complementary.
D. standard.
Correct : C. complementary.
31. In genetic algorithm the problem is considered in terms of _________.
A. values.
B. points in multidimensional space.
C. node.
D. strings of characters.
Correct : D. strings of characters.
32. In UK,_______ has applied data mining techniques to analyze viewing figures. a. a press .
A. press
B. BBC
C. CNN
D. NDT
Correct : B. BBC
33. In K- nearest neighbor the input is translated to __________.
A. values
B. points in multidimensional space
C. strings of characters
D. nodes
Correct : B. points in multidimensional space
34. In machine learning ________ phase try to find the patterns from observations.
A. observation
B. theory
C. analysis
D. prediction
Correct : C. analysis
35. __________________refers to the process of deriving high-quality information from text.
A. Text Mining.
B. Image Mining.
C. Database Mining.
D. Multimedia Mining.
Correct : A. Text Mining.
36. The process of selecting good hypothesis and improving the theory based on this is called _______.
A. heuristic search
B. hill climbing algorithm.
C. incremental search.
D. apriori algorithm
Correct : B. hill climbing algorithm.
37. _____________ is the application of data mining techniques to discover patterns from the Web.
A. Text Mining.
B. Multimedia Mining.
C. Web Mining.
D. Link Mining.
Correct : C. Web Mining.
38. It is important to know the complexity of the _______ before developing any machine learning algorithm.
A. data
B. algorithm
C. search space
D. learning
Correct : C. search space
39. Information content is closely related to ______ and transparency.
A. algorithm.
B. search space.
C. learning.
D. statistical significance.
Correct : D. statistical significance.
40. The ________ is used to express the hypothesis describing the concept.
A. computer language.
B. algorithm.
C. definition.
D. theory
Correct : A. computer language.
41. A definition of a concept is complete if it recognizes _________.
A. all the information.
B. all the instances of a concept.
C. only positive examples.
D. negative examples.
Correct : B. all the instances of a concept.
42. The results of machine learning algorithms are always have to be checked for their _________.
A. observations.
B. calculations
C. programs.
D. statistical relevance.
Correct : D. statistical relevance.
43. A ________ is necessary condition for KDDs effective implement.
A. data set.
B. database.
C. data warehouse.
D. data.
Correct : C. data warehouse.
44. The first international KDD conference was held in the year ________.
A. 1995.
B. 1994.
C. 1993.
D. 1992.
Correct : A. 1995.
45. AI stands for ____.
A. art of interest.
B. artificial interest.
C. art of intelligence.
D. artificial intelligence.
Correct : D. artificial intelligence.
46. KDD is a ________.
A. new technology that is use to store data.
B. multidisciplinary field of research.
C. database technology.
D. expert system.
Correct : B. multidisciplinary field of research.
47. ______ could generate rule automatically.
A. KDD.
B. machine learning.
C. artificial intelligence.
D. expert system.
Correct : B. machine learning.
48. Intelligent miner is a mining tool from _______.
A. Clementine.
B. living stones.
C. IBM.
D. Wipro.
Correct : C. IBM.
49. The organization such as ______ is in USA.
A. AT & T.
B. AD & T.
C. AA & T.
D. AT & D.
Correct : A. AT & T.
50. ________ is a mining tool from integral solutions.
A. WEKA
B. web miner.
C. rapid miner.
D. clementine.
Correct : D. clementine.
51. ________ % of KDD is about preparing data.
A. 60.
B. 70
C. 80
D. 90
Correct : C. 80
52. The ______ is one of the operation research techniques.
A. association rules.
B. k-nearest neighbor.
C. decision trees.
D. genetic algorithm.
Correct : B. k-nearest neighbor.
53. Everything that science discovers has only ______ value.
A. standard.
B. different.
C. same.
D. temporary.
Correct : D. temporary.
54. A good introduction to machine learning is the idea of ______.
A. concept learning.
B. content learning.
C. theory of falsification.
D. Poppers law.
Correct : A. concept learning.
55. The algorithms that are controlled by human during their execution is _______ algorithm.
A. unsupervised.
B. supervised.
C. batch learning.
D. incremental.
Correct : B. supervised.
56. Background knowledge depends on the form of ______________.
A. theoretical knowledge.
B. hypothesis.
C. formulae.
D. knowledge representation.
Correct : D. knowledge representation.
57. Bias helps to ______.
A. learn.
B. complete the search.
C. execute the search.
D. constrain the search and utilizes KDD to analyze client files.
Correct : D. constrain the search and utilizes KDD to analyze client files.
58. A _____ algorithm takes all the data at once and tries to create a hypothesis based on this data.
A. supervised.
B. batch learning.
C. unsupervised.
D. incremental learning.
Correct : B. batch learning.
59. A ________ algorithm takes a new piece of information at each learning cycle and tries to revise the theory using new data.
A. supervised.
B. batch learning.
C. unsupervised.
D. incremental learning.
Correct : B. batch learning.
60. The _________ forms the background knowledge in the inductive logic programming.
A. prolog program.
B. perl.
C. python.
D. ruby.
Correct : A. prolog program.
61. In KDD process _______ % is about mining.
A. 40.
B. 30.
C. 20.
D. 10.
Correct : C. 20.
62. ________ is used to find the vaguely known data.
A. SQL.
B. KDD.
C. Data mining.
D. Sybase.
Correct : C. Data mining.
63. A definition of a concept is _______ if it does not classify any negative examples as falling under the concept.
A. complete.
B. consistent.
C. good.
D. bad.
Correct : B. consistent.
64. Lot of kangaroo jumping around the country side is an example for ________.
A. parallelism.
B. concept learning.
C. machine learning.
D. data mining.
Correct : A. parallelism.
65. The easiest way to gain access to the data and facilitate effective decision making is to set up a _______.
A. database.
B. data mart.
C. data warehouse.
D. operational.
Correct : C. data warehouse.
66. Smaller local data warehouse is called as ____.
A. data mart.
B. database.
C. data model.
D. meta data.
Correct : B. database.
67. Data warehouse is only used for _____.
A. operating the data.
B. managing the data.
C. decision making.
D. queries.
Correct : D. queries.
68. The _______ data are stored in data warehouse.
A. operational.
B. historical.
C. transactional.
D. optimized.
Correct : B. historical.
69. A decision support system is a system that ________.
A. can constantly change over time.
B. cannot change.
C. copies the data.
D. supports the system.
Correct : A. can constantly change over time.
70. Metadata is used by the end users for ______.
A. managing database.
B. structuring database.
C. querying purposes.
D. making decisions.
Correct : C. querying purposes.
71. The _________ techniques are used to load information from operational database to data warehouse.
A. reengineering.
B. reverse.
C. transfer.
D. replication.
Correct : D. replication.
72. The __________ represents the best choice for building a data warehouse.
A. client/server.
B. database.
C. bottom up.
D. visualization.
Correct : A. client/server.
73. The __________ is one of database that operates on massively parallel computer.
A. sybase.
B. SQL.
C. postgre SQL.
D. tandem.
Correct : D. tandem.
74. ________ is more recent expert system.
A. Mycin.
B. Gasoil.
C. BMT.
D. XCONVAX.
Correct : B. Gasoil.
75. A ______ is not the rule that govern the basic structure of data warehouse.
A. time dependent.
B. volatile.
C. subject oriented.
D. integrated.
Correct : B. volatile.
76. The metadata that is generated at the time of building a warehouse is called ______.
A. Build time metadata.
B. Usage metadata.
C. Control metadata.
D. structure metadata.
Correct : A. Build time metadata.
77. The control metadata is used to _______.
A. design a metadata.
B. administrate the metadata.
C. track the sequence and timing of warehouse events.
D. control the data.
Correct : C. track the sequence and timing of warehouse events.
78. A data warehouse is said to contain a time-varying collection of data because ___.
A. its contents vary automatically with time.
B. its lifespan is very limited.
C. it contains historical data.
D. its content has explicit stamp.
Correct : C. it contains historical data.
79. A data warehouse is an integrated collection of data because _____.
A. it is a collection of data of different data types.
B. it is a collection of data derived from multiple sources.
C. it is a relational database.
D. it contains summarized data.
Correct : B. it is a collection of data derived from multiple sources.
80. Expert systems are ________.
A. system that contain the knowledge of specialists.
B. system that can think of their own.
C. system that can work.
D. system that can create the knowledge.
Correct : A. system that contain the knowledge of specialists.
81. _______ is an expert who analyzed the effect of using machine learning algorithm in setting up expert system.
A. Borges.
B. Popper.
C. Bratko.
D. Papert.
Correct : C. Bratko.
82. The element that is not taken into consideration for cost justification for the implementation of KDD environment is _______.
A. speed.
B. cost.
C. complexity.
D. repetition.
Correct : B. cost.
83. A ______ is an interactive system that enables decision makers to use database and models on a computer in order to solve ill structured problems.
A. database.
B. client/server.
C. DSS.
D. mainframe
Correct : C. DSS.
84. The _______ is a symbolic representation of facts or ideas from which information can potentially be extracted.
A. knowledge.
B. data.
C. algorithm.
D. program.
Correct : B. data.
85. DB/2 is a family of RDBMS marketed by _____.
A. HCL.
B. Wipro.
C. IBM.
D. Infosys.
Correct : C. IBM.
86. A collection of interesting and useful patterns in database is called _______.
A. knowledge.
B. information.
C. data.
D. algorithm.
Correct : A. knowledge.
87. In data mining software that works on local workstation is used to _______.
A. write coding.
B. generate screen and reports for the end user.
C. make decisions.
D. find patterns.
Correct : B. generate screen and reports for the end user.
88. A ________ acts a bridge between data warehouse and database application.
A. data mart.
B. operational data.
C. meta data.
D. data cube.
Correct : C. meta data.
89. The _____ operation is used for reducing data cube by one or more dimensions.
A. drilling.
B. rolling.
C. dicing.
D. slicing.
Correct : D. slicing.
90. The main organizational justification for implementing a data warehouse is to provide ______.
A. cheaper ways of handling transportation.
B. decision support.
C. storing large volume of data.
D. access to data.
Correct : C. storing large volume of data.
91. KDD consists of ______ stages.
A. four.
B. five.
C. six.
D. seven.
Correct : C. six.
92. _______ is the first stage in KDD process.
A. Data selection.
B. Cleaning.
C. Mining.
D. Enrichment.
Correct : A. Data selection.
93. The term that is not associated with data cleaning process is ______.
A. domain consistance.
B. de-duplication.
C. disambiguation.
D. segmentation.
Correct : D. segmentation.
94. In _______ process of KDD additional information can be added to the existing data.
A. enrichment.
B. coding.
C. selecting.
D. cleaning.
Correct : A. enrichment.
95. _______ is a type of coding operation that occurs frequently in KDD context.
A. Filtering.
B. Visualization.
C. Flattening.
D. Replication.
Correct : C. Flattening.
96. SQL stands for ________.
A. simple query language.
B. structured query language.
C. strong query language.
D. simple language.
Correct : B. structured query language.
97. _________ is one of the traditional query tool.
A. MYSQL.
B. OLAP.
C. PL/SQL.
D. SQL.
Correct : D. SQL.
98. The _____ is a useful method of discovering patterns at the beginning of data mining process.
A. calculating distance.
B. visualization techniques.
C. decision trees.
D. association rules.
Correct : B. visualization techniques.
99. A/An_____ is an object oriented 3D tool kit which enables the user to explore 3D structure.
A. inventor.
B. tandim.
C. mantis.
D. extruder.
Correct : A. inventor.
100. The field of research dedicated to the search for interesting projections of datasets are called __________.
A. projection pursuit.
B. research pursuit.
C. projection.
D. dataset pursuit.
Correct : A. projection pursuit.