0% found this document useful (0 votes)
133 views

Automatic Semantic Classification and Categorization of Web Services in Digital Environment

The document proposes a framework for automatically classifying and categorizing web services in a digital environment through semantic analysis. The framework uses a semantic web crawler to retrieve service information from websites. It annotates the service metadata using ontologies and stores it in a database. The classified service metadata can then be queried to discover and categorize relevant web services based on domain knowledge. The goal is to improve upon existing web service discovery approaches by enabling automated semantic classification of services.

Uploaded by

Rifqi Zumadi
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
133 views

Automatic Semantic Classification and Categorization of Web Services in Digital Environment

The document proposes a framework for automatically classifying and categorizing web services in a digital environment through semantic analysis. The framework uses a semantic web crawler to retrieve service information from websites. It annotates the service metadata using ontologies and stores it in a database. The classified service metadata can then be queried to discover and categorize relevant web services based on domain knowledge. The goal is to improve upon existing web service discovery approaches by enabling automated semantic classification of services.

Uploaded by

Rifqi Zumadi
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

Automatic Semantic Classification and Categorization of Web

Services in Digital Environment


Mr. V.V.Sawant,
M.E. Student, CSE Department, Dr. V.R.Ghorpade
D. Y. Patil College of Engineering & Technology, Professor & Principal, D. Y. Patil College of
Kolhapur, Engineering & Technology, Kolhapur,
Maharashtra, India. Maharashtra, India
[email protected] [email protected]

Abstract - Classification of web services through along with the evolution of business network and
semantic service discovery of a similar event will be information technology. Web services digital network
the feature services. However, to improve the selection based on the needs of the service provider as servers
and matching process is not enough. The existing and services that are customer service as the two roles
service discovery approaches often published keyword it can play. But, to provide the web services it is
matching to find web services practices. In this paper currently facing the following problems.
we propose a framework for automatic service
classification and categorization of web service • Service provider marks the Semantic Web
process in digital environment. The proposed uniform resources identifiers (URI) which will
framework semantically perform automated service use the language to describe.
discovery and domain selection using domain- • To get online service pages instances is very
knowledge ontology based classification in a digital difficult due to this the information retrieving is
environment to improvise the service categorization. It very difficult from the existing pages and those
is efficiently able to classify and annotated service information will not annotated as the semantic
information by means of specific service domain services.
knowledge. In order to thoroughly evaluate the • Some services providers will work based on the
performance of our proposed semantic based crawlers semantic annotation based on the domain
for automatic service discovery, we measure the knowledge; this information will provide very less
Precision, Mean Average Precision, Recall and F- useful information. And we do not have any
measure Rates. proper methods to do this work and the existing
building principal retrieving process information
Keywords: Service Categorization, Web Service and services will not ubiquity.
discovery, Semantic Classification, Digital
environment, Ontology Structure. To solve the above problems of the existing systems
we proposing an automatic framework that is
1. INTRODUCTION semantically discover and categorization the web
service. It will allow resolving the functionality of the
A large number of digitally distributed applications on existing problems.
the Web services architecture facilitate the creation of
service-oriented architecture in web environment [6]. The rest of the paper is present related work in section-
These web services communication among others data 2. The proposed Automatic Semantic Classification
growth of e-commerce, marketing, offer various and Categorization of Web Services are presented in
functionalities in the areas of utilities. Some of the section-3. Experiment and Result Analysis is
web services are published and implemented in-house presented in section-4 and 5. Finally, in section-6 we
by various organizations [7][8]. These web services summarized the work in conclusion.
are used for professional applications, or may be in the
government and military. However, this structure 2. RELATED WORKS
requires careful selection of appropriate Web services
[9][10]. Service providers are within the registry of The existing approaches are focusing on any one of
web services that are specified by the predefined the process that can be a services requester or service
categories. Consequently, similar services can be listed provider [1][2][13]. A service provider enters a Digital
under various categories. environment by publishing a service entity, which will
be stored in distributed service knowledge bases [4].
The emergence of digital business can be attributed to Here, these service entities are stored in the form of
the natural existence of business ecosystems [11][12], service metadata [5].
Metadata abstraction focused crawlers are the focused 3. AUTOMATIC SEMANTIC LASSIFICATION
crawlers that can abstract meaningful information AND CATEGORIZATION OF WEB
from relevant Web pages and annotate the information SERVICES
with ontology markup languages [15]. Francesconi
and Peruginelli [18] proposed a metadata abstraction The proposed framework as shown in Figure-1 will
focused crawler for a vertical portal system, which is a follows some functionality to solve the existing
management system for legal documents. Giles et al. problems through web crawling and it is able to
[19] proposed a metadata abstraction focused crawler retrieve the information regarding the service entities
for a niche e-business information search engine. from the Web, which corresponds to the functionality
of service discovery in web. The crawler is able to
Tane et al. [14] proposed an ontology-based focused annotate the service information with the purpose of
crawler for an ontology management system semantic and to store the semantic service information,
Courseware Watchdog. Ganesh et al. [15] proposed an which corresponds to the functionality of service
association metric, with the purpose of optimizing the annotation. The crawler is able to filter and classify the
order of visited URLs for Web crawlers. THESUS[16] annotated service information by means of specific
aims to organize online documents by linking their service ontology domain knowledge, which
URLs to hierarchical ontology concepts, which are corresponds to the functionality of service
seen as thematic subsets. classification in web services.

Figure-1: Framework for automatic semantic classification and categorization of web services

3.1 Service Metadata Generator have the privilege of associating with service
metadata, as a higher level concept comprises all the
Service Metadata Generator provides the bottom-level metadata associated by its subconcepts.
concepts which have the property of linked service
metadata, which are allowed to associate with service A service metadata generator collects the webpage
metadata. It stores the generated service metadata, and from the web and those data will make the tired
the latter is employed to store the predefined service ontology framework metadata database which will
domain ontologies. In web services the service domain helps in the Domain service webpage process. This
ontology provides a hierarchy of service concepts, in process will collect the webpage form the web and
which each concept is an abstraction of the service classify the web page is services and domain, by using
entities that share some common features. The that data it extract the metadata from the page and
concepts are related by the concept-subconcepts those data will sending to the service metadata
relationship, in which a concept is the abstraction of its generator, the service metadata generator will collect
subconcepts. For the purposes of metadata the metadata and process the metadata store in the
classification, only the bottom-level service concepts service Ontology structure database.
3.2 Metadata Generation using Semantic Web 3.3 Semantic Categorization of the Web Services
crawler
The classification and annotation service will work
The Semantic Metadata web Crawler will accept the based on the user query. It will get the Metadata from
URL as the input and Metadata will provide as the the Services Metadata Database and classifying the
output. This process will have some internal process results of web pages, means the user request resources
those are webpage fetcher, policy center, webpage will matching the fetched pages data and each page
pool, webpage parser, service metadata generator, data will annotated on the user request and provide the
service metadata, service metadata generator. The rank to the pages and those page links will providing
web page fetcher will take the URL as the input and in response to the user using the algorithm defined in
fetch the webpage that web page will send to two sub [3]. It process the collected the metadata from the
process those policy center and webpage pool. The metadata generator and metadata from service
policy center will take the web page and extract the ontology structure data to compare the semantic
URL and check the deftness and crawler rules, then similarity of data and identifying the services category
send URL return as the input. And next the webpage and provide an web services list as output.
pool, this will do two works those are the web page
will convert as the plain text and store in the form of 4. EXPERIMENT
text file and then send to the web page parser.
To perform the experimental evaluation we generate a
It process based the following methods webpage synthetic metadata using online web services and
fetcher, policy center, and webpage parser. The create a database of service ontology structure as
webpage fetcher will fetch the pages from the web and discuss below.
extract the URL from the webpage and send the URL
to Policy center. The policy center will process URL 4.1 Service Ontology Structure
on the domain services knowledge and crawler
boundaries. The URL will satisfy the page will fetched A well define service ontology structure is defined
and stored in the webpage poll repository, else the based on some domain identified as shown in figure-2.
webpage will discarded. In the webpage poll extract It defined to support the automatic classification and
the webpage and removing the embedded tags, annotation of the web pages with associated of
converting to plain text, from the plain text extracting ontology structure.
the information snippets.

Figure-2: Designed Web Service Ontology Structure

4.2 Semantic Based crawling and Categorization database for automatic semantic classification and
of web Services categorization of the web services. Table-1 shows
the metadata generated and there semantic
We process 1000 different web service pages offline classification relation and their categorization
to generate metadata. This generated metadata is used obtained during implementation. Table-2 shows the
to builds web service ontology structure with categorized web service links in related the service
semantic association relation of the extracted ontology structure.
metadata. The generated service metadata is stored in
Table-1: Classified WebPages based on Metadata Generated

Webpage Metadata Generated Semantic Classification


Webpage1.txt airline tickets airlines booking flight tickets airfares india flight ticket SEMANTIC MATCHING (%) : 100%
bookings flight bookings mumbai delhi flights delhi mumbai flights air CATID : 3 - Travel
tickets online flight booking book flight ticket flight search kingfisher SUB CATID : 3 - Airway
airlines SLINK : http://www.flykingfisher.com/
Webpage3.txt ebay electronics cars clothing apparel collectibles sporting goods digital SEMANTIC MATCHING (%) : 100%
cameras antiques tickets jewelry online shopping auction online auction CATID : 2 - E-Business
SUB CATID : 3 - General Items
SLINK : http://www.ebay.in/
Webpage30.txt best eye hospital best eye doctor eye specialist eye surgery eye clinic SEMANTIC MATCHING (%) : 100%
eye treatment eye hospital eye operation eye institute lasik surgery lasik CATID : 1 - HealtCare
treatment SUB CATID : 1 - Eye Care
SLINK : http://www.vasaneye.in/
Webpage46.txt spun yarn cotton combed cotton yarn price carded cotton polyester yarn SEMANTIC MATCHING (%) : 97%
viscose yarn pv pc acrylic spun yarn dyed yarn cotton yarn price CATID : 4 - Industries
organic cotton yarn cotton yarn exports pima yarn price trends SUB CATID : 2 - Textile Industries
importing countries count wise exports fob prices on core spun yarn SLINK : http://www.yarnsandfibers.com/
Webpage7.txt security equipments manufacturer india security equipments SEMANTIC MATCHING (%) : 90%
manufacturer mumbai metal detectors manufacturer india metal CATID : 2 - E-Business
detectors manufacturer mumbai walkthrough metal detectors SUB CATID : 2 - Security Equipmets
(dfmd/hhmd) manufacturer india SLINK : http://www.samarthsec.com/

Table-2: Categorized Web Services based Services

Services Categorized Service List Links


HealtCare Eye Care http://www.preetihospital.com/
Eye Care http://www.ezsoft-sol.com/
Eye Care http://www.vasaneye.in/
Cancer care http://www.apollohospitals.com/
General Care http://www.apollohospitalsnoida.com/
General Care http://www.conquercancerfoundation.org/
General Care http://www.owaisihospital.com/
General Care http://www.carehospitals.com/
E-Business Industry Equipments http://www.starcnc.com/
Industry Equipments http://www.khuranadies.com/
Industry Equipments http://www.technomechmachinetools.com/
Security Equipmets http://www.zicom.com/
Security Equipmets http://www.samarthsec.com/
General Items http://www.quikr.com/
General Items http://www.jabong.com/
General Items http://www.ebay.in/
General Items http://www.flipkart.com/
Travel Roadway http://www.ksrtc.in/
Roadway http://www.mybustickets.in/
Roadway http://www.redbus.in/
Roadway http://www.apsrtconline.in/
Railway http://www.olx.in/
Railway http://www.trainenquiry.com/
Railway http://www.indianrail.gov.in/
Railway http://www.indianrailways.gov.in/
Airway http://www.jetairways.com/
Airway http://www.flykingfisher.com/
Airway http://www.airindia.com/
Industries Steel Industries http://www.bajajsteel.net/
Steel Industries http://www.jyotisteel.com/
Textile Industries http://www.vtiltowels.com/
Textile Industries http://www.yarnsandfibers.com/
Plastic Industries http://www.plasticcontainers.co.in/
Plastic Industries http://www.calcuttaplastics.in/
5. RESULT ANALYSIS

To evaluate the effectiveness of the proposed system


we measure the Precision rate, Recall rate, Harvest,
rate Fallout rate and F1-measures as defined [20].
The computed measures are shown in figures-3, 4, 5,
6 and 7. The obtained results show an improvisation
in precision rate, Harvest rate and F1-Measure of web
service retrieval and minimize the recall and fallout
rates, which prove the improvisation of the proposal.
The analysis is made on the 1000 WebPages obtained Figure-6: Harvest Rate at different threshold value
from web with different domain queries. With a
variation support threshold association value from 10
to 100 we compute the measure in relate the number
of relevant Data Semantically Matched, number of
Relevant Data Not-Retrieved and number of Non-
Relevant Data Retrieved.

Figure-7: F-Measure Rate at different threshold value

6. CONCLUSION

This paper presents a framework for automatic


service discovery in a huge digital web environment.
Figure-3: Precision Rate at different threshold value It proposes an efficient approach to build a service
ontology structure and classifying the web service
into category for user service improvisation. In order
to achieve the goal of automatic semantic service
discovery and classification it presents a framework
model consist of a semantic web crawler, metadata
generator and semantic classification and
categorization.

The main functions of the proposed work is to


semantically discover the service information from
the Web pages by parsing, annotating, and storing
their service information which will be used for
Figure-4: Recall Rate at different threshold value classification the categorization of the web service
based on specific service ontology domain
knowledge. This approach defined a general format
for service metadata and service concept, which
enables the function of similarity computation and
the association between metadata and concepts. The
experiment results shows that, increase of the support
threshold value can reduce the amount of associated
and non relevant metadata, and the relatively higher
support threshold values can benefit the overall
performance of the automatic service discovery.

Figure-5: Fallout Rate at different threshold value


REFERENCES

[1]. Aabhas V. Paliwal, Basit Shaafiq, Jaideep Vaidya, Hui Xiong, engine for e-business," in Proc. SIGIR, Toronto, ON, Canada, ,
Nabil Adam, members IEEE, "Semantics-based automated pp. 413-414, 2003.
service discovery", IEEE Transactions On Services [20]. L. T. Su, "The relevance of recall and precision in user
Computing, Vol. 5, No. 2, June 2012, pp. 260-275. evaluation”, J. Amer. Soc. Inf. Sci. Technol., vol. 45, no. 3, pp.
[2]. Joshi A, Patel C, Supekar K, "Swoogle: A meta-data search 207-217, Apr. 1999.
engine for the semantic web & OntoKhoj: A semantic web
portal for ontology searching, ranking and classification", 19th
ACM Conf. Information and Knowledge Management, Nov.
2009, pp.652-668.
[3]. A. Sajjanhar, J. Hou, and Y. Zhang, "Algorithm for Web
Services Matching", Proc. Asia-Pacific Web Conference
(APWeb), pp. 665-670, 2004..
[4]. E. Chang and M. West, "Digital ecosystem-A next generation
of the collaborative environment," in Proc. iiWAS,
Yogyakarta, Indonesia, 2006, pp. 3-24.
[5]. H. Boley and E. Chang, "Digital Ecosystems: Principles and
semantics," in Proc. IEEE DEST, Cairns, Australia, 2007, pp.
398-403.
[6]. J. L. M. Lastra and M. Delamer, "Semantic web services in
factory automation: Fundamental insights and research
roadmap," IEEE Trans. Ind. Informat., vol. 2, no. 1, pp. 1-11,
Feb. 2006.
[7]. P. Malone, "DE services in Ecosystem Oriented
Architectures," in Digital Business Ecosystems, F. Nachira, P.
Dini, A. Nicolai, M. L. Louarn, and L. R. Lèon, Eds: Eur.
Commission, 2007.
[8]. H. Dong, F. K. Hussain, and E. Chang, "A service search
engine for the industrial digital ecosystems," IEEE Trans. Ind.
Electron., vol. 58, no. 6, pp. 2183-2196, Jun. 2011, DOI:
10.1109/TIE, 2009.
[9]. A. C. Weaver and M. W. Condry, "Distributing Internet
services to the network's edge," IEEE Trans. Ind. Electron.,
vol. 50, no. 3, pp. 404-411, Jun. 2003.
[10]. M. Wu, J.-H. She, G.-X. Zeng, and Y. Ohyama, "Internet-
based teaching and experiment system for control engineering
course," IEEE Trans. Ind. Electron., vol. 55, no. 6, pp. 2386-
2396, Jun. 2008.
[11]. H. Dong, F. K. Hussain, and E. Chang, "State of the art in
semantic focused crawlers," in Proc. ICCSA, Yongin, Korea,
2009, pp. 890-904.
[12]. H. Dong, F. K. Hussain, and E. Chang, "A survey in semantic
web technologies-inspired focused crawlers," in Proc. 3rd
ICDIM, East London, U.K., 2008, pp. 934-936.
[13]. M. Yuvarani, N. C. S. N. Iyengar, and A. Kannan, "Scrawled:
A framework for an enhanced focused web crawler based on
link semantics," in Proc. IEEE/WIC/ACM Int. Conf. WI,
2006, pp. 794-800.
[14]. J. Tane, C. Schmitz, and G. Stumme, "Semantic resource
management for the web: An e-learning application," in Proc.
WWW, NY, 2004, pp. 1-10.
[15]. S. Ganesh, M. Jayaraj, V. Kalyan, and G. Aghila, "Ontology-
based web crawler," in Proc. ITCC: Coding Comput., Las
Vegas, NV, 2004, pp. 337-341.
[16]. M. Halkidi, B. Nguyen, I. Varlamis, and M. Vazirgiannis,
"THESUS: Organizing web document collections based on
link semantics," VLDB J., vol. 12, no. 4, pp. 320-332, Nov.
2003.
[17]. H. Dong, F. K. Hussain, and E. Chang, "State of the art in
metadata abstraction crawlers," in Proc. IEEE ICIT, Chengdu,
China, 2008, pp. 1-6.
[18]. E. Francesconi and G. Peruginelli, "Searching and retrieving
legal literature through automated semantic indexing," in Proc.
ICAIL, Standford, CA, pp. 131-138, 2007.
[19]. C. L. Giles, Y. Petinot, P. B. Teregowda, H. Han, S. Lawrence,
A. Rangaswamy, and N. Pal, "eBizSearch: A niche search

You might also like