Data mining is used to extract knowledge from huge amount of the data Today, Data mining helps different organizations focus on customer’s behavior patterns. The research scope of data mining extended in various fields. This paper, discusses the concept of data mining, important issues and applications. So there comes the need of powerful and most importantly automatic tools for uncovering valuable slots of organized information from tremendous amount of data. Considering social networking site or a search engine, they receive millions of queries every day. Firstly, the Database Management Systems evolved to handle the queries of similar types. Then the approach was modified to advanced Database management system, Data warehousing and Data mining for advance data analysis and web based databases. Data mining has immensely penetrated in each and every field of day to day life
Introduction
The adage "We are living in the era of information" underscores the vast amount of data generated today. However, raw data alone is insufficient; it must be processed and refined to extract meaningful information. This process is known as Knowledge Discovery in Databases (KDD), often referred to as data mining. Data mining involves analyzing large datasets to identify patterns, correlations, and trends that can inform decision-making.
What Is Data Mining?
According to the Gartner Group, data mining is the process of discovering meaningful correlations, patterns, and trends by sifting through large amounts of data stored in repositories, using pattern recognition technologies as well as statistical and mathematical techniques. gartner.com
Data mining is an interdisciplinary field that combines techniques from machine learning, pattern recognition, statistics, databases, and visualization to extract useful information from large datasets.
Why Use Data Mining?
Organizations utilize data mining for several reasons:
Too much data and too little information: Organizations accumulate vast amounts of data but often lack the means to extract actionable insights.
Need to extract useful information: Data mining helps in interpreting complex datasets to derive meaningful information.
Competitive pressure: In a competitive business environment, organizations use data mining to gain a competitive edge.
Quality and timely response: Data mining enables organizations to respond quickly and effectively to changing conditions.
History of Data Mining
The term "data mining" emerged in the 1990s, but its roots trace back to classical statistics, artificial intelligence, and machine learning. Data mining represents the convergence of these fields, adapting machine learning techniques for business applications.
Issues in Data Mining
Several challenges accompany data mining:
Security and social issues: The collection and analysis of large datasets can raise privacy concerns and potential misuse of sensitive information.
User interface issues: Effective visualization and user-friendly interfaces are crucial for interpreting data mining results.
Mining methodology issues: Selecting appropriate mining techniques and handling diverse data types can be complex.
Performance issues: Processing large datasets efficiently requires scalable algorithms and computational resources.
Data source issues: The diversity and volume of data sources pose challenges in data integration and analysis.
Dependency Modeling: Understanding relationships between variables.
Deviation Detection: Identifying anomalies.
Summarization: Providing compact representations of data.
Conclusion
Data mining is to discover or extract knowledge or data from large amount of database. In this paper, we introduced briefly reviewed concept of data mining, issues of data mining and areas of data mining where used today. It would be helpful to researchers to focus on the various issues and challenges of data mining. Data mining software is one of a number of analytical tools for analyzing data. It allows users to analyze data from many different dimensions or angles, categorize it, and summarize the relationships identified. From the last decades, data mining and knowledge discovery applications have important significance in decision making and it has become an essential component in various organizations and fields. The complexity of data mining must be hidden from end-users before it will take the true center stage in an organization. Business use cases can be designed, with tight constrains, around data mining algorithms. Due to the enormous success of various application areas of data mining, the field of data mining has been establishing itself as the major discipline of computer science and has shown interest potential for the future developments.
References
[1] Osmar R. Zaiane, “Principles of Knowledge Discovery in Databases”, CMPUT690, University of Alberia.
[2] Han and M. Kamber. Data Mining, “Concepts and Techniques”, Morgan Kaufmann, 2000.
[3] Bharati M. Ramageri, “Data Mining Techniques and Applications”, Indian Journal of Computer Science and Engineering, Vol. 1 No. 4 301-305
[4] Alex Berson, Stephen Smith, and Kurt Thearling, “Building Data Mining Applications for CRM”.
[5] Barbara, Daniel; Jajodia, Sushil (Eds.), “Applications of Data Mining in Computer Security”
[6] Gary M. Weiss, “Data Mining in the Telecommunications Industry”, Fordham University, USA.
[7] Khalid Raza, “Application of Data Mining in Bioinformatics”, “Indian Journal of Computer Science and Engineering”