Quiznetik

Data Mining | Set 2

1. _________ is an example for case based-learning.

Correct : D. K-nearest neighbor.

2. ___________ percentage of the interesting information can be obtained by using SQL.

Correct : A. 80

3. ________ is the technique which is used for discovering patterns in dataset at the beginning of data mining process.

Correct : B. Visualization.

4. In K-nearest neighbor algorithm K stands for ________.

Correct : A. number of neighbors that are investigated.

5. The complexity of data mining algorithm is represented by ________.

Correct : C. n log n.

6. Genetic algorithm was proposed by _______.

Correct : A. John Holland.

7. ________ is the first stage in genetic algorithm.

Correct : C. Creation of population of string.

8. The _________ is one of genetic operators that are used to recombine the population of genetic material.

Correct : A. genetic operator.

9. _______ is the heart of knowledge discovery in database process.

Correct : D. Creative coding.

10. ______ is a planning optimization application written for KLM

Correct : B. CAPTAINS.

11. EIS stands for _________.

Correct : A. Executive Information System.

12. Foreign key constraints are also referred as _______.

Correct : B. referential integrity.

13. The set of attribute in a database that refers to data in another table is called ______.

Correct : C. foreign key.

14. The distance between two points that is calculated using Pythagoras theorem is _________.

Correct : B. eucledian distance.

15. A database containing volatile data used for daily operation of an organization is ______.

Correct : D. operational data.

16. The system that can be used without knowledge of internal operation _______.

Correct : A. black box.

17. ______ is the relationship between compressibility and learnability.

Correct : B. Minimum description length principle.

18. In KDD and data mining, noise is referred to as ________.

Correct : D. random errors in database.

19. DSS stands for _______.

Correct : B. Decision Support System.

20. Data mining algorithms require ___________

Correct : D. All of the above.

21. The algorithm that need to access a table several times during execution is_______.

Correct : A. n-table scan algorithm.

22. A coding operation in which an attribute with cardinality n is replaced by n binary attributes is called as ______.

Correct : C. flattening of table.

23. The un-normalized relation containing all attributes that exist in database is ______.

Correct : D. universal relation.

24. The technique of learning by generalizing from examples is ________.

Correct : B. inductive learning.

25. The ever increasing amount of data is compared to that of infinite library by Jorge Louis Borges in his short stories namely _________.

Correct : C. the library of Babel.

26. ______ itself has become a production factor of importance.

Correct : B. Information.

27. The _______ plays an important role in artificial intelligence.

Correct : D. learning capabilities.

28. Knowledge discovery in database refers to _____.

Correct : A. whole process of extraction of knowledge from data.

29. Data mining is used to refer ______ stage in knowledge discovery in database.

Correct : C. discovery.

30. Query tools and data mining tools are _______.

Correct : C. complementary.

31. In genetic algorithm the problem is considered in terms of _________.

Correct : D. strings of characters.

32. In UK,_______ has applied data mining techniques to analyze viewing figures. a. a press .

Correct : B. BBC

33. In K- nearest neighbor the input is translated to __________.

Correct : B. points in multidimensional space

34. In machine learning ________ phase try to find the patterns from observations.

Correct : C. analysis

35. __________________refers to the process of deriving high-quality information from text.

Correct : A. Text Mining.

36. The process of selecting good hypothesis and improving the theory based on this is called _______.

Correct : B. hill climbing algorithm.

37. _____________ is the application of data mining techniques to discover patterns from the Web.

Correct : C. Web Mining.

38. It is important to know the complexity of the _______ before developing any machine learning algorithm.

Correct : C. search space

39. Information content is closely related to ______ and transparency.

Correct : D. statistical significance.

40. The ________ is used to express the hypothesis describing the concept.

Correct : A. computer language.

41. A definition of a concept is complete if it recognizes _________.

Correct : B. all the instances of a concept.

42. The results of machine learning algorithms are always have to be checked for their _________.

Correct : D. statistical relevance.

43. A ________ is necessary condition for KDDs effective implement.

Correct : C. data warehouse.

44. The first international KDD conference was held in the year ________.

Correct : A. 1995.

45. AI stands for ____.

Correct : D. artificial intelligence.

46. KDD is a ________.

Correct : B. multidisciplinary field of research.

47. ______ could generate rule automatically.

Correct : B. machine learning.

48. Intelligent miner is a mining tool from _______.

Correct : C. IBM.

49. The organization such as ______ is in USA.

Correct : A. AT & T.

50. ________ is a mining tool from integral solutions.

Correct : D. clementine.

51. ________ % of KDD is about preparing data.

Correct : C. 80

52. The ______ is one of the operation research techniques.

Correct : B. k-nearest neighbor.

53. Everything that science discovers has only ______ value.

Correct : D. temporary.

54. A good introduction to machine learning is the idea of ______.

Correct : A. concept learning.

55. The algorithms that are controlled by human during their execution is _______ algorithm.

Correct : B. supervised.

56. Background knowledge depends on the form of ______________.

Correct : D. knowledge representation.

57. Bias helps to ______.

Correct : D. constrain the search and utilizes KDD to analyze client files.

58. A _____ algorithm takes all the data at once and tries to create a hypothesis based on this data.

Correct : B. batch learning.

59. A ________ algorithm takes a new piece of information at each learning cycle and tries to revise the theory using new data.

Correct : B. batch learning.

60. The _________ forms the background knowledge in the inductive logic programming.

Correct : A. prolog program.

61. In KDD process _______ % is about mining.

Correct : C. 20.

62. ________ is used to find the vaguely known data.

Correct : C. Data mining.

63. A definition of a concept is _______ if it does not classify any negative examples as falling under the concept.

Correct : B. consistent.

64. Lot of kangaroo jumping around the country side is an example for ________.

Correct : A. parallelism.

65. The easiest way to gain access to the data and facilitate effective decision making is to set up a _______.

Correct : C. data warehouse.

66. Smaller local data warehouse is called as ____.

Correct : B. database.

67. Data warehouse is only used for _____.

Correct : D. queries.

68. The _______ data are stored in data warehouse.

Correct : B. historical.

69. A decision support system is a system that ________.

Correct : A. can constantly change over time.

70. Metadata is used by the end users for ______.

Correct : C. querying purposes.

71. The _________ techniques are used to load information from operational database to data warehouse.

Correct : D. replication.

72. The __________ represents the best choice for building a data warehouse.

Correct : A. client/server.

73. The __________ is one of database that operates on massively parallel computer.

Correct : D. tandem.

74. ________ is more recent expert system.

Correct : B. Gasoil.

75. A ______ is not the rule that govern the basic structure of data warehouse.

Correct : B. volatile.

76. The metadata that is generated at the time of building a warehouse is called ______.

Correct : A. Build time metadata.

77. The control metadata is used to _______.

Correct : C. track the sequence and timing of warehouse events.

78. A data warehouse is said to contain a time-varying collection of data because ___.

Correct : C. it contains historical data.

79. A data warehouse is an integrated collection of data because _____.

Correct : B. it is a collection of data derived from multiple sources.

80. Expert systems are ________.

Correct : A. system that contain the knowledge of specialists.

81. _______ is an expert who analyzed the effect of using machine learning algorithm in setting up expert system.

Correct : C. Bratko.

82. The element that is not taken into consideration for cost justification for the implementation of KDD environment is _______.

Correct : B. cost.

83. A ______ is an interactive system that enables decision makers to use database and models on a computer in order to solve ill structured problems.

Correct : C. DSS.

84. The _______ is a symbolic representation of facts or ideas from which information can potentially be extracted.

Correct : B. data.

85. DB/2 is a family of RDBMS marketed by _____.

Correct : C. IBM.

86. A collection of interesting and useful patterns in database is called _______.

Correct : A. knowledge.

87. In data mining software that works on local workstation is used to _______.

Correct : B. generate screen and reports for the end user.

88. A ________ acts a bridge between data warehouse and database application.

Correct : C. meta data.

89. The _____ operation is used for reducing data cube by one or more dimensions.

Correct : D. slicing.

90. The main organizational justification for implementing a data warehouse is to provide ______.

Correct : C. storing large volume of data.

91. KDD consists of ______ stages.

Correct : C. six.

92. _______ is the first stage in KDD process.

Correct : A. Data selection.

93. The term that is not associated with data cleaning process is ______.

Correct : D. segmentation.

94. In _______ process of KDD additional information can be added to the existing data.

Correct : A. enrichment.

95. _______ is a type of coding operation that occurs frequently in KDD context.

Correct : C. Flattening.

96. SQL stands for ________.

Correct : B. structured query language.

97. _________ is one of the traditional query tool.

Correct : D. SQL.

98. The _____ is a useful method of discovering patterns at the beginning of data mining process.

Correct : B. visualization techniques.

99. A/An_____ is an object oriented 3D tool kit which enables the user to explore 3D structure.

Correct : A. inventor.

100. The field of research dedicated to the search for interesting projections of datasets are called __________.

Correct : A. projection pursuit.