Printer Friendly
The Free Dictionary
1,080,885,824 visitors served.
?
Dictionary/
thesaurus
Medical
dictionary
Legal
dictionary
Financial
dictionary
Acronyms
 
Idioms
Encyclopedia
Wikipedia
encyclopedia
?

Data mining
(redirected from Knowledge Discovery in Databases)

   Also found in: Financial, Acronyms, Wikipedia, Hutchinson 0.07 sec.

data mining

Type of database analysis that attempts to discover useful patterns or relationships in a group of data. The analysis uses advanced statistical methods, such as cluster analysis, and sometimes employs artificial intelligence or neural network techniques. A major goal of data mining is to discover previously unknown relationships among the data, especially when the data come from different databases. Businesses can use these new relationships to develop new advertising campaigns or make predictions about how well a product will sell. Governments also use these techniques to discern illegal or embargoed activities by individuals, associations, and other governments.


Exploring and analyzing detailed business transactions. It implies "digging through tons of data" to uncover patterns and relationships contained within the business activity and history. Data mining can be done manually by slicing and dicing the data until a pattern becomes obvious. Or, it can be done with programs that analyze the data automatically. Data mining has become an important part of customer relationship management (CRM). In order to better understand customer behavior and preferences, businesses use data mining to wade through the huge amounts of information gathered via the Web. See data miner, Web mining, OLAP, DSS, EIS, data warehouse and slice and dice.

Doing It Automatically
The goal of this credit card analysis is to determine the most influential factors common to non-profitable customers. In this case, BusinessMiner from Business Objects determined that the credit limit had the greatest effect on profitability and prioritized the results in graphical form. (Screen shot courtesy of Business Objects.)


Data mining

The development of computational algorithms for the identification or extraction of structure from data. This is done in order to help reduce, model, understand, or analyze the data. Tasks supported by data mining include prediction, segmentation, dependency modeling, summarization, and change and deviation detection. Database systems have brought digital data capture and storage to the mainstream of data processing, leading to the creation of large data warehouses. These are databases whose primary purpose is to gain access to data for analysis and decision support. Traditional manual data analysis and exploration requires highly trained data analysts and is ineffective for high dimensionality (large numbers of variables) and massive data sets. See Database management system

A data set can be viewed abstractly as a set of records, each consisting of values for a set of dimensions (variables). While data records may exist physically in a database system in a schema that spans many tables, the logical view is of concern here. Databases with many dimensions pose fundamental problems that transcend query execution and optimization. A fundamental problem is query formulation: How is it possible to provide data access when a user cannot specify the target set exactly, as is required by a conventional database query language such as SQL (Structured Query Language)? Decision support queries are difficult to state. For example, which records are likely to represent fraud in credit card, banking, or telecommunications transactions? Which records are most similar to records in table A but dissimilar to those in table B? How many clusters (segments) are in a database and how are they characterized? Data mining techniques allow for computer-driven exploration of the data, hence admitting a more abstract model of interaction than SQL permits.

Data mining techniques are fundamentally data reduction and visualization techniques. As the number of dimensions grows, the number of possible combinations of choices for dimensionality reduction explodes. For an analyst exploring models, it is infeasible to go through the various ways of projecting the dimensions or selecting the right subsamples (reduction along columns and rows). Data mining is based on machine-based exploration of many of the possibilities before a selected reduced set is presented to the analyst for feedback.


(database)data mining - Analysis of data in a database using tools which look for trends or anomalies without knowledge of the meaning of the data. Data mining was invented by IBM who hold some related patents.

Data mining may well be done on a data warehouse.

ShowCase STRATEGY is an example of a data mining tool.


How to thank TFD for its existence? Tell a friend about us, add a link to this page, add the site to iGoogle, or visit webmaster's page for free fun content.
?Page tools
Printer friendly
Cite / link
Email
Feedback
? Mentioned in ? References in periodicals archive
 
Knowledge discovery in databases (KDD) has become a hot topic in recent years.
Fayyad was recognized for his extensive contributions to the fields of machine learning and data mining and for his significant scientific and commercial applications in the field of knowledge discovery in databases.
Research publications in library and information science have been implicitly related to knowledge discovery in databases (KDD) in terms of methods and techniques, though many of them did not use the terminology "knowledge discovery in databases" explicitly.
 
Encyclopedia browser? ? Full browser
 
 
Encyclopedia
?

Disclaimer | Privacy policy | Feedback | Copyright © 2008 Farlex, Inc.
All content on this website, including dictionary, thesaurus, literature, geography, and other reference data is for informational purposes only. This information should not be considered complete, up to date, and is not intended to be used in place of a visit, consultation, or advice of a legal, medical, or any other professional.. Terms of Use.