DM Unit 5
DM Unit 5
ithin the ‘nego (in terms of time) between the events can also agaconsraint The gap can be set to 0 (ie., gap = 0), et 8 value or a range of values. In the first case with :F allie sequential patterns that do not have any gap in sure found. Inthe second case all the sequential patterns cvemeract gap gin between their events are found. In | sszdcase, where the gap is defined as a range of values, all sequtens that ae separated by at least minimum gap but astaaximum gap are found, | Constraints Related to the Type of Sequential Patterns: Inthis type of sequential pattern mining, regular -pssiuareused for providing pattern templates in the form lad parallel episodes. The group of events that occurs tutal order are said to be as serial episode. The parallel itt the other hand refers to the group of events whose is trivial, a cuenta patterns can be mine deyct {9 the mining of association rules. 12,2" efficient algorithm to mine multi ase ential patterns from a transaction toving ta coxam™Ple of such a pattern is the Weroson ncustomer who buys a PC will buy Wichone stare within three months”, on Mein ofthe ill down to find a more refined patterns, such as “A customer buys a Penti i ity ‘entium PC will buy Microsoft office Mes hte months”, eon rl Nasi F . vit tna el pattern is a sequence of ordered esting OM: The additional information © buys a poe Pattemns. To the given example, “A eat dite ill buy Microsoft software within Neietomaic’ such Petation on this pattern gives saat buy Mi *h as“ A customer who buys @ ‘ge Version one office within three months”. “th ofce inns Patter gives detail information in methods Sequential pattem, and multidimensio PrefixSpan mining algorithm for searchi searching multilevel nal sequential patterns, ean Answer : Periodicity Analysis It is the mining of périodic Patterns in time series databases. It is performed for ‘searching repetitive patterns that describe occurrence of events: over regular intervals of time, Due to this property, it is applicable in various fields like weather forecasting, daily traffic analysis, every day power consumption analysis, occurrence of tides, plants trajectory analysis. The time-related data that is used for periodicity analysis can be events or sequence of values which have occurred at regular intervals or varying intervals of time. The data can be related to frequency of events or about the usage of an item, Event is also called categorical data like occurrence of a storm, buying a computer, An item data is usually numerical data that can be electricity consumption, daily traffic. ‘A pperiodic patter is classified as full and partial periodic patterns, based on coverage of the patter. 1. Full Periodic Pattern “This isa patter that takes into account every point in time to bea part of the eycic behaviour of a time-elated sequence, An example ofl periodic pattem isthe growth of child ina year ean be atibuted o every day of thot year 2, Partial Periodic Pattern in time are isa pattern where only few points in time a sr ke eyli behaviour oa tme related sequence. contributed puch apatte ste dil routine ofan individual acer does few specific things on a fixed time everyday in it is activities. These jlarity on his other activities. bas aE ai yelic behaviours in the real patterns can be related to many €¥ world. | Gis ofsome Microsoft software. © lSPECTRUM @LLIN-ONE JOURNAL FOR ENGINEERING STUDENTS ~~ *Periodicity patterns can also be elasified based on the cy oftime. They are called synchronous and asynchronous ns. A synchronous periodicity pattern requires an event 10 occur only ata fixed time in each stable period while iy Tenehtonous periodicity patter, the ime of events ean be relaxed Rape extent. For example, waking up in the moming at 3:00 ‘AM every day isan example of synchronous periodie patter, A pattem can also be classified 88 precise or approximate }orning at 5:00 AM on some ‘lays but 5:15 AM on some other days. Full pet iodicity analysis on numerical data require ae ys ical data requires ‘Ourier transformation on data so that the data ‘omain can be transformed to frequency domain, Similarly, the periodicity analysis of partial periodic patterns for the same example is a time consuming process. Because it Tequites the details of fixed events to be specified in advance. Therefore, partial periodicity analysis result in discovery of period’ i tion rules that associate a set of Periodic events for efficient pattern search, 5.4 MINING OBJECT, SPATIAL, MULTIMEDIA, TEXT AND WEB DATA Q31. Explain about multidimensional analysis and descriptive mining of complex data objects. Answer : Model Papers, ati(a) Scientific research and engineering design are advanced data-intensive applications which are used for storing, indexing, accessing and manipulating complex data objects. It is a very difficult task to represent these complex objects as simple and consistent structured records. To serve the application requirements, such as efficient storage capacity and accessing ~ large amount of disk which deals with complex structured data objects motivated to design and develop database systems. There are two kinds of database systems, 1. Object-relational database system 2. Object-oriented database system : In the field of databases, rigorous research is conducted on object-relational and object-oriented database systems to know how efficiently complex objects can be indexed, stored, accessed and manipulated. In these systems, a large group of different data objects are organized into classes. These classes are categorized in turn into the hierarchies of classes and subclasses. DATA MINING LNTUHYOER A, ‘A class containing object is associated wit, ()_Anidentitier for each object. objectidening, include list-valued dat ii) Asetofatributes that include es eee vlerarehies of clases, subclasses and Tru data. " ch is used to specify the, t of methods which is used to specify the ey (0) Pies associated wih the objet clas. hat A systematic analysis and mining of large complex structured data object are said bea complex pt data. It consists of two important tasks @ Construction of multidimensional data Warchousg complex object data and performing OLAP oper! Efficient methods are developed for petforniy knowledge discovery by extracting relevant know (b) leg from different data warehouse. This skis aecompg by extracting particular type of data which ining multimedia, spatial, temporal data ete, Limitations of Data Warehouse and OLAP Tools Following are the limitations of data warchouse ay OLAP tools, 1, Multidimensional data analysis is confined to use linig number of data types associated with dimensions a measures. 2. "Data cube implementations restrict dimensions to th following, Categorical data ‘+ Measures which are simple aggregated values li count( ), sum( 932. What is structured data? How generalizator can be performed on structured data? Answer : Structured Data . Structured Data is a type of data, that is stored in structured format, in accordance to specific data models. T data needs to be restructured in several ways, so as to ext ‘implicit patterns that may not be apparent otherwise. Generalization of Structured Data“ The essential characteristi object-oriented databases is that access and model complex stru tic of object-relational a they are having ability ts ictured-valued data. Structured data can either be one of the following. 1. Setevalued data 2. List-valued data 3. Complex structure-valued data, WARNING: xerouPnotccpyigo is books 8 CRIMINAL ot Anyone und ty LABLE Waco LEGAL graces | a