Quiznetik

Data Mining | Set 2

1. _________ is an example for case based-learning.

A. Decision trees.

B. Neural networks.

C. Genetic algorithm.

D. K-nearest neighbor.

Correct : D. K-nearest neighbor.

2. ___________ percentage of the interesting information can be obtained by using SQL.

A. 80

B. 70

C. 40

D. 50

Correct : A. 80

3. ________ is the technique which is used for discovering patterns in dataset at the beginning of data mining process.

A. Kohenon map.

B. Visualization.

C. OLAP.

D. SQL.

Correct : B. Visualization.

4. In K-nearest neighbor algorithm K stands for ________.

A. number of neighbors that are investigated.

B. number of iterations.

C. number of total records.

D. random number.

Correct : A. number of neighbors that are investigated.

5. The complexity of data mining algorithm is represented by ________.

A. log n.

B. 2n log n.

C. n log n.

D. 2 log n.

Correct : C. n log n.

6. Genetic algorithm was proposed by _______.

A. John Holland.

B. Johnson.

C. Watson.

D. Kohenon.

Correct : A. John Holland.

7. ________ is the first stage in genetic algorithm.

A. Evaluation of each string.

B. Selection of string.

C. Creation of population of string.

D. Genetic manipulation.

Correct : C. Creation of population of string.

8. The _________ is one of genetic operators that are used to recombine the population of genetic material.

A. genetic operator.

B. mutation operator.

C. cross over operator.

D. encoding operator.

Correct : A. genetic operator.

9. _______ is the heart of knowledge discovery in database process.

A. Selection.

B. Data ware house.

C. Data mining.

D. Creative coding.

Correct : D. Creative coding.

10. ______ is a planning optimization application written for KLM

A. PILOTS.

B. CAPTAINS.

C. CUSTOMERS.

D. AIRLINES.

Correct : B. CAPTAINS.

11. EIS stands for _________.

A. Executive Information System.

B. Exchange of Information System.

C. Extra Information System.

D. Extended Information system.

Correct : A. Executive Information System.

12. Foreign key constraints are also referred as _______.

A. consistency constraints.

B. referential integrity.

C. conditional integrity.

D. domain constraints.

Correct : B. referential integrity.

13. The set of attribute in a database that refers to data in another table is called ______.

A. primary key.

B. candidate key.

C. foreign key.

D. super key.

Correct : C. foreign key.

14. The distance between two points that is calculated using Pythagoras theorem is _________.

A. cartesian distance.

B. eucledian distance.

C. extendable distance.

D. heuristic distance.

Correct : B. eucledian distance.

15. A database containing volatile data used for daily operation of an organization is ______.

A. historic data.

B. metadata.

C. knowledge.

D. operational data.

Correct : D. operational data.

16. The system that can be used without knowledge of internal operation _______.

A. black box.

B. white box.

C. case based learning.

D. bias.

Correct : A. black box.

17. ______ is the relationship between compressibility and learnability.

A. Maximum description length principle.

B. Minimum description length principle.

C. Kolmogorov complexity.

D. Voronoi principle.

Correct : B. Minimum description length principle.

18. In KDD and data mining, noise is referred to as ________.

A. repeated data.

B. complex data.

C. meta data.

D. random errors in database.

Correct : D. random errors in database.

19. DSS stands for _______.

A. Deciding Support System.

B. Decision Support System.

C. Decision Software System.

D. Decision System of System.

Correct : B. Decision Support System.

20. Data mining algorithms require ___________

A. efficient sampling method.

B. storage of intermediate results.

C. capacity to handle large amounts of data.

D. All of the above.

Correct : D. All of the above.

21. The algorithm that need to access a table several times during execution is_______.

A. n-table scan algorithm.

B. zoom scan algorithm.

C. hybrid algorithm.

D. nearest neighbor search.

Correct : A. n-table scan algorithm.

22. A coding operation in which an attribute with cardinality n is replaced by n binary attributes is called as ______.

A. falsification of table.

B. enrichment of table.

C. flattening of table.

D. fuzzification of table.

Correct : C. flattening of table.

23. The un-normalized relation containing all attributes that exist in database is ______.

A. actual relation.

B. transparent relation.

C. verified relation.

D. universal relation.

Correct : D. universal relation.

24. The technique of learning by generalizing from examples is ________.

A. incremental learning.

B. inductive learning.

C. hybrid learning.

D. generalized learning.

Correct : B. inductive learning.

25. The ever increasing amount of data is compared to that of infinite library by Jorge Louis Borges in his short stories namely _________.

A. the library of Louis.

B. the library of Borges.

C. the library of Babel.

D. the library of Boulevard.

Correct : C. the library of Babel.

26. ______ itself has become a production factor of importance.

A. Data.

B. Information.

C. Program.

D. Algorithm.

Correct : B. Information.

27. The _______ plays an important role in artificial intelligence.

A. programming skill.

B. scheduling.

C. planning.

D. learning capabilities.

Correct : D. learning capabilities.

28. Knowledge discovery in database refers to _____.

A. whole process of extraction of knowledge from data.

B. selection of data.

C. coding.

D. cleaning the data.

Correct : A. whole process of extraction of knowledge from data.

29. Data mining is used to refer ______ stage in knowledge discovery in database.

A. selection.

B. retrieving.

C. discovery.

D. coding.

Correct : C. discovery.

30. Query tools and data mining tools are _______.

A. same.

B. different.

C. complementary.

D. standard.

Correct : C. complementary.

31. In genetic algorithm the problem is considered in terms of _________.

A. values.

B. points in multidimensional space.

C. node.

D. strings of characters.

Correct : D. strings of characters.

32. In UK,_______ has applied data mining techniques to analyze viewing figures. a. a press .

A. press

B. BBC

C. CNN

D. NDT

Correct : B. BBC

33. In K- nearest neighbor the input is translated to __________.

A. values

B. points in multidimensional space

C. strings of characters

D. nodes

Correct : B. points in multidimensional space

34. In machine learning ________ phase try to find the patterns from observations.

A. observation

B. theory

C. analysis

D. prediction

Correct : C. analysis

35. __________________refers to the process of deriving high-quality information from text.

A. Text Mining.

B. Image Mining.

C. Database Mining.

D. Multimedia Mining.

Correct : A. Text Mining.

36. The process of selecting good hypothesis and improving the theory based on this is called _______.

A. heuristic search

B. hill climbing algorithm.

C. incremental search.

D. apriori algorithm

Correct : B. hill climbing algorithm.

37. _____________ is the application of data mining techniques to discover patterns from the Web.

A. Text Mining.

B. Multimedia Mining.

C. Web Mining.

D. Link Mining.

Correct : C. Web Mining.

38. It is important to know the complexity of the _______ before developing any machine learning algorithm.

A. data

B. algorithm

C. search space

D. learning

Correct : C. search space

39. Information content is closely related to ______ and transparency.

A. algorithm.

B. search space.

C. learning.

D. statistical significance.

Correct : D. statistical significance.

40. The ________ is used to express the hypothesis describing the concept.

A. computer language.

B. algorithm.

C. definition.

D. theory

Correct : A. computer language.

41. A definition of a concept is complete if it recognizes _________.

A. all the information.

B. all the instances of a concept.

C. only positive examples.

D. negative examples.

Correct : B. all the instances of a concept.

42. The results of machine learning algorithms are always have to be checked for their _________.

A. observations.

B. calculations

C. programs.

D. statistical relevance.

Correct : D. statistical relevance.

43. A ________ is necessary condition for KDDs effective implement.

A. data set.

B. database.

C. data warehouse.

D. data.

Correct : C. data warehouse.

44. The first international KDD conference was held in the year ________.

A. 1995.

B. 1994.

C. 1993.

D. 1992.

Correct : A. 1995.

45. AI stands for ____.

A. art of interest.

B. artificial interest.

C. art of intelligence.

D. artificial intelligence.

Correct : D. artificial intelligence.

46. KDD is a ________.

A. new technology that is use to store data.

B. multidisciplinary field of research.

C. database technology.

D. expert system.

Correct : B. multidisciplinary field of research.

47. ______ could generate rule automatically.

A. KDD.

B. machine learning.

C. artificial intelligence.

D. expert system.

Correct : B. machine learning.

48. Intelligent miner is a mining tool from _______.

A. Clementine.

B. living stones.

C. IBM.

D. Wipro.

Correct : C. IBM.

49. The organization such as ______ is in USA.

A. AT & T.

B. AD & T.

C. AA & T.

D. AT & D.

Correct : A. AT & T.

50. ________ is a mining tool from integral solutions.

A. WEKA

B. web miner.

C. rapid miner.

D. clementine.

Correct : D. clementine.

51. ________ % of KDD is about preparing data.

A. 60.

B. 70

C. 80

D. 90

Correct : C. 80

52. The ______ is one of the operation research techniques.

A. association rules.

B. k-nearest neighbor.

C. decision trees.

D. genetic algorithm.

Correct : B. k-nearest neighbor.

53. Everything that science discovers has only ______ value.

A. standard.

B. different.

C. same.

D. temporary.

Correct : D. temporary.

54. A good introduction to machine learning is the idea of ______.

A. concept learning.

B. content learning.

C. theory of falsification.

D. Poppers law.

Correct : A. concept learning.

55. The algorithms that are controlled by human during their execution is _______ algorithm.

A. unsupervised.

B. supervised.

C. batch learning.

D. incremental.

Correct : B. supervised.

56. Background knowledge depends on the form of ______________.

A. theoretical knowledge.

B. hypothesis.

C. formulae.

D. knowledge representation.

Correct : D. knowledge representation.

57. Bias helps to ______.

A. learn.

B. complete the search.

C. execute the search.

D. constrain the search and utilizes KDD to analyze client files.

Correct : D. constrain the search and utilizes KDD to analyze client files.

58. A _____ algorithm takes all the data at once and tries to create a hypothesis based on this data.

A. supervised.

B. batch learning.

C. unsupervised.

D. incremental learning.

Correct : B. batch learning.

59. A ________ algorithm takes a new piece of information at each learning cycle and tries to revise the theory using new data.

A. supervised.

B. batch learning.

C. unsupervised.

D. incremental learning.

Correct : B. batch learning.

60. The _________ forms the background knowledge in the inductive logic programming.

A. prolog program.

B. perl.

C. python.

D. ruby.

Correct : A. prolog program.

61. In KDD process _______ % is about mining.

A. 40.

B. 30.

C. 20.

D. 10.

Correct : C. 20.

62. ________ is used to find the vaguely known data.

A. SQL.

B. KDD.

C. Data mining.

D. Sybase.

Correct : C. Data mining.

63. A definition of a concept is _______ if it does not classify any negative examples as falling under the concept.

A. complete.

B. consistent.

C. good.

D. bad.

Correct : B. consistent.

64. Lot of kangaroo jumping around the country side is an example for ________.

A. parallelism.

B. concept learning.

C. machine learning.

D. data mining.

Correct : A. parallelism.

65. The easiest way to gain access to the data and facilitate effective decision making is to set up a _______.

A. database.

B. data mart.

C. data warehouse.

D. operational.

Correct : C. data warehouse.

66. Smaller local data warehouse is called as ____.

A. data mart.

B. database.

C. data model.

D. meta data.

Correct : B. database.

67. Data warehouse is only used for _____.

A. operating the data.

B. managing the data.

C. decision making.

D. queries.

Correct : D. queries.

68. The _______ data are stored in data warehouse.

A. operational.

B. historical.

C. transactional.

D. optimized.

Correct : B. historical.

69. A decision support system is a system that ________.

A. can constantly change over time.

B. cannot change.

C. copies the data.

D. supports the system.

Correct : A. can constantly change over time.

70. Metadata is used by the end users for ______.

A. managing database.

B. structuring database.

C. querying purposes.

D. making decisions.

Correct : C. querying purposes.

71. The _________ techniques are used to load information from operational database to data warehouse.

A. reengineering.

B. reverse.

C. transfer.

D. replication.

Correct : D. replication.

72. The __________ represents the best choice for building a data warehouse.

A. client/server.

B. database.

C. bottom up.

D. visualization.

Correct : A. client/server.

73. The __________ is one of database that operates on massively parallel computer.

A. sybase.

B. SQL.

C. postgre SQL.

D. tandem.

Correct : D. tandem.

74. ________ is more recent expert system.

A. Mycin.

B. Gasoil.

C. BMT.

D. XCONVAX.

Correct : B. Gasoil.

75. A ______ is not the rule that govern the basic structure of data warehouse.

A. time dependent.

B. volatile.

C. subject oriented.

D. integrated.

Correct : B. volatile.

76. The metadata that is generated at the time of building a warehouse is called ______.

A. Build time metadata.

B. Usage metadata.

C. Control metadata.

D. structure metadata.

Correct : A. Build time metadata.

77. The control metadata is used to _______.

A. design a metadata.

B. administrate the metadata.

C. track the sequence and timing of warehouse events.

D. control the data.

Correct : C. track the sequence and timing of warehouse events.

78. A data warehouse is said to contain a time-varying collection of data because ___.

A. its contents vary automatically with time.

B. its lifespan is very limited.

C. it contains historical data.

D. its content has explicit stamp.

Correct : C. it contains historical data.

79. A data warehouse is an integrated collection of data because _____.

A. it is a collection of data of different data types.

B. it is a collection of data derived from multiple sources.

C. it is a relational database.

D. it contains summarized data.

Correct : B. it is a collection of data derived from multiple sources.

80. Expert systems are ________.

A. system that contain the knowledge of specialists.

B. system that can think of their own.

C. system that can work.

D. system that can create the knowledge.

Correct : A. system that contain the knowledge of specialists.

81. _______ is an expert who analyzed the effect of using machine learning algorithm in setting up expert system.

A. Borges.

B. Popper.

C. Bratko.

D. Papert.

Correct : C. Bratko.

82. The element that is not taken into consideration for cost justification for the implementation of KDD environment is _______.

A. speed.

B. cost.

C. complexity.

D. repetition.

Correct : B. cost.

83. A ______ is an interactive system that enables decision makers to use database and models on a computer in order to solve ill structured problems.

A. database.

B. client/server.

C. DSS.

D. mainframe

Correct : C. DSS.

84. The _______ is a symbolic representation of facts or ideas from which information can potentially be extracted.

A. knowledge.

B. data.

C. algorithm.

D. program.

Correct : B. data.

85. DB/2 is a family of RDBMS marketed by _____.

A. HCL.

B. Wipro.

C. IBM.

D. Infosys.

Correct : C. IBM.

86. A collection of interesting and useful patterns in database is called _______.

A. knowledge.

B. information.

C. data.

D. algorithm.

Correct : A. knowledge.

87. In data mining software that works on local workstation is used to _______.

A. write coding.

B. generate screen and reports for the end user.

C. make decisions.

D. find patterns.

Correct : B. generate screen and reports for the end user.

88. A ________ acts a bridge between data warehouse and database application.

A. data mart.

B. operational data.

C. meta data.

D. data cube.

Correct : C. meta data.

89. The _____ operation is used for reducing data cube by one or more dimensions.

A. drilling.

B. rolling.

C. dicing.

D. slicing.

Correct : D. slicing.

90. The main organizational justification for implementing a data warehouse is to provide ______.

A. cheaper ways of handling transportation.

B. decision support.

C. storing large volume of data.

D. access to data.

Correct : C. storing large volume of data.

91. KDD consists of ______ stages.

A. four.

B. five.

C. six.

D. seven.

Correct : C. six.

92. _______ is the first stage in KDD process.

A. Data selection.

B. Cleaning.

C. Mining.

D. Enrichment.

Correct : A. Data selection.

93. The term that is not associated with data cleaning process is ______.

A. domain consistance.

B. de-duplication.

C. disambiguation.

D. segmentation.

Correct : D. segmentation.

94. In _______ process of KDD additional information can be added to the existing data.

A. enrichment.

B. coding.

C. selecting.

D. cleaning.

Correct : A. enrichment.

95. _______ is a type of coding operation that occurs frequently in KDD context.

A. Filtering.

B. Visualization.

C. Flattening.

D. Replication.

Correct : C. Flattening.

96. SQL stands for ________.

A. simple query language.

B. structured query language.

C. strong query language.

D. simple language.

Correct : B. structured query language.

97. _________ is one of the traditional query tool.

A. MYSQL.

B. OLAP.

C. PL/SQL.

D. SQL.

Correct : D. SQL.

98. The _____ is a useful method of discovering patterns at the beginning of data mining process.

A. calculating distance.

B. visualization techniques.

C. decision trees.

D. association rules.

Correct : B. visualization techniques.

99. A/An_____ is an object oriented 3D tool kit which enables the user to explore 3D structure.

A. inventor.

B. tandim.

C. mantis.

D. extruder.

Correct : A. inventor.

100. The field of research dedicated to the search for interesting projections of datasets are called __________.

A. projection pursuit.

B. research pursuit.

C. projection.

D. dataset pursuit.

Correct : A. projection pursuit.