Data Warehousing & Mining: Unit - V
Data Warehousing & Mining: Unit - V
UNIT – V
7
Prof. S.K. Pandey, I.T.S, Ghaziabad
Limitations of Web Mining
Web mining the technology itself doesn’t create issues, but this technology when used on data of
personal nature might cause concerns.
The most criticized ethical issue involving web mining is the invasion of privacy.
Privacy is considered lost when information concerning an individual is obtained, used, or
disseminated, especially if this occurs without their knowledge or consent. The obtained data will be
analyzed, and clustered to form profiles; the data will be made anonymous before clustering so that no
individual can be linked directly to a profile. But usually the group profiles are used as if they are
personal profiles.
Thus these applications de-individualize the users by judging them by their mouse clicks. De-
individualization, can be defined as a tendency of judging and treating people on the basis of group
characteristics instead of on their own individual characteristics and merits.
Another important concern is that the companies collecting the data for a specific purpose might use the
data for a totally different purpose, and this essentially violates the user’s interests. The growing trend
of selling personal data as a commodity encourages website owners to trade personal data obtained from
their site. This trend has increased the amount of data being captured and traded increasing the
likeliness of one’s privacy being invaded.
The companies which buy the data are obliged make it anonymous and these companies are considered
authors of any specific release of mining patterns. They are legally responsible for the contents of the
release; any inaccuracies in the release will result in serious lawsuits, but there is no law preventing
them from trading the data.