L_1 Data Mining
L_1 Data Mining
Data Distribution:
Real-worlds data is usually stored on various platforms in a
distributed computing environment. It might be in a
database, individual systems, or even on the internet.
Practically, It is a quite tough task to make all the data to a
centralized data repository mainly due to organizational and
technical concerns. For example, various regional offices
may have their servers to store their data. It is not feasible to
store, all the data from all the offices on a central server.
Therefore, data mining requires the development of tools and
algorithms that allow the mining of distributed data.
Complex Data:
Real-world data is heterogeneous, and it could be
multimedia data, including audio and video, images,
complex data, spatial data, time series, and so on. Managing
these various types of data and extracting useful information
is a tough task. Most of the time, new technologies, new
tools, and methodologies would have to be refined to obtain
specific information.
Performance:
The data mining system's performance relies primarily on the
efficiency of algorithms and techniques used. If the designed
algorithm and techniques are not up to the mark, then the
efficiency of the data mining process will be affected
adversely.
Data Privacy and Security:
Data mining usually leads to serious issues in terms of data
security, governance, and privacy. For example, if a retailer
analyzes the details of the purchased items, then it reveals
data about buying habits and preferences of the customers
without their permission.
Data Visualization:
In data mining, data visualization is a very important process
because it is the primary method that shows the output to the
user in a presentable way. The extracted data should convey
the exact meaning of what it intends to express. But many
times, representing the information to the end-user in a
precise and easy way is difficult. The input data and the
output information being complicated, very efficient, and
successful data visualization processes need to be
implemented to make it successful.
There are many more challenges in data mining in addition
to the problems above-mentioned. More problems are
disclosed as the actual data mining process begins, and the
success of data mining relies on getting rid of all these
difficulties.
Prerequisites
Before learning the concepts of Data Mining, you should
have a basic understanding of Statistics, Database
Knowledge, and Basic programming language.
Audience
Our Data Mining Tutorial is prepared for all beginners or
computer science graduates to help them learn the basics to
advanced techniques related to data mining.
Problems
We assure you that you will not find any difficulty while
learning our Data Mining tutorial. But if there is any mistake
in this tutorial, kindly post the problem or error in the contact
form so that we can improve it.