SEARCH ENGINE (Synopsis) - Vivek
SEARCH ENGINE (Synopsis) - Vivek
SYNOPSIS
OF SEARCH ENGINE USING JAVA
SUBMITTED BY
VIVEK PANDEY
Batch Year – 2019-22
Enrollment No. – U196073
PROJECT GUIDE
ER. DHANANJAY SINGH
The search engine concepts particularly based on the search the information on the World Wide Web.
Most engines allow you to type in a few words, and then search for occurrences of these words
in their data base. If you just type words into the "basic search" interface you get from the search engine's main page. Basically we are
trying to create a search engine named "JUST BROWSE" which is searching for your city ,basic contact and descriptive information of
your city firms and other basic need places.
Most engines have separate advanced search forms
where you can be more specific, and form complex Boolean searches. Some search tools parse HTML tags, allowing you to look for
things specifically as links, or as a title or URL without consideration of the text on the page. We here not tying to use that we just have
simply keywords to be search they are also being categorized under some areas where search can be more filtered the idea behind the
website to make the city information on web through a search engine the concept was first implemented manually through "yellow pages
book" now only the metropoltican cities have them and there online implementation is also done but here we are also going to
implement the rural areas near us to web or internet what we say .
PROBLEM DEFINITION
I have been keeping some issues of the most common search engine ranking problems, and have come up with a point of what I would
say are the “most common” search engine ranking problems. Many of these issues can easily be fixed, of course it will take time or
money (or both).
Lack of links , repetitive title tracks , too many 404 error , unclean URLs , slow web page load time .
MOTIVATION
Information is a very essential need for the peoples related to corporate, social, political, entertainment and educational sectors. It is
quite difficult to get required information’s in short time. By using internet resources we can get those information’s but it is time
consuming as we have search through various sites randomly as we can’t say about the exact place where we can get the required
information’s. The development in various aspects of computer technology has reached beyond our imagination & expectations. Every
now and then, new technologies are launched in the market to ease our daily works. This fact inspired me to develop a web search
engine which will help in finding information’s from internet very easily and in fast way. Users can access huge amount of information’s
in no time by just giving a keyword as input and within a moment all information’s related to that keyword will be indexed on the screen
according to their popularity.
2
OBJECTIVE
Search Engine project will able to provide users required information at one particular place by using the words and patterns
entered by the user during their search operation. All the information will be provided over the browser screen where users can select
appropriate link filtered by the search query. Whatever the information presented to the user can be in any form by default such as it may
be in the form of web pages, pdf file, doc file etc. The search query will provide listing of web pages as per their occurrence during
search operations.
This search mechanism will work on the concept of tags and meta tags which are used
while writing the contents under the particular web pages. If the user’s query don’t matched with the tags and meta tags then it will go
for summary section to match the given words in order to present the exact output or results. Upon going through web pages an index
file will be created where listing of pages will be done by the system and present them as per their index number. Keyword density for a
particular post under a particular web page will also help the system to set the priority for indexing the web page.
3
REQUIREMENT ANALYSIS
Software Analysis
• Phase Analysis
During the feasibility phase, first of all I tried to find out what are the requirements for my project, and come up with a solution that to
build a search engine I require a database which would contain all the information’s like images, audios and videos and an interface
through which user can access those information’s. So for making the database I require Wamp and for the interface I need PHP and
HTML pages. Thus during the Requirement analysis & specification phase I collected all the data that would be stored to the database
i.e. images, audios, videos.
At the designing phase I designed the database which contains designing of
tables and setting up of the table fields, data types, required fields, primary keys etc. next I have designed the interface that contains
designing of interface using HTML and JAVA.
During the coding and unit testing phase we divided my system design into modules and start writing codes for
those modules, after writing codes for each module, and tested each of them to check whether it is working correctly or not. During
integration and system testing phase, Integration of different modules is undertaken once the different modules have been coded and unit
tested in a planned manner. Though integrating all the modules is not done in one shot, during each integration step, previously planned
modules are added to the partially integrated system and the resultant system is tested. After all the modules have been successfully
integrated and tested, then we have carried out system testing to check whether the fully developed system conforms to its requirement
specification or not in order to ensure that the final product is error free.
4
Study Design
5
Types of Search Engine
Search engines are classified into the following three categories based on how it works.
• Crawling
• Indexing
• Calculating Relevancy
• Retrieving the Result
Most of the popular search engines are crawler based search engines and use the above technology to display search results.
Example of crawler based search engines:
• Google
• Bing
• Yahoo!
• Baidu
• Yandex
• Besides these popular search engines there are many other crawler based search engines available like DuckDuckGo, AOL and Ask.
6
2. Human Powered Directories
Human powered directories also referred as open directory system depends on human based activities for listings. Below is how the
indexing in human powered directories work:
• Site owner submits a short description of the site to the directory along with category it is to be listed.
• Submitted site is then manually reviewed and added in the appropriate category or rejected for listing.
• Keywords entered in a search box will be matched with the description of the sites. This means the changes made to the content of a
web pages are not taken into consideration as it is only the description that matters.
• A good site with good content is more likely to be reviewed for free compared to a site with poor content.
• Yahoo! Directory and DMOZ were perfect examples of human powered directories. Unfortunately, automated search engines
like Google, wiped out all those human powered directory style search engines out of the web.
Hybrid Search Engines use both crawler based and manual indexing for listing the sites in search results. Most of the crawler based
search engines like Google basically uses crawlers as a primary mechanism and human powered directories as secondary mechanism.
For example, Google may take the description of a webpage from human powered directories and show in the search results. As human
powered directories are disappearing, hybrid types are becoming more and more crawler based search engines.
But still
there are manual filtering of search result happens to remove the copied and spammy sites. When a site is being identified for spammy
activities, the website owner needs to take corrective action and resubmit the site to search engines. The experts do manual review of the
submitted site before including it again in the search results. In this manner though the crawlers control the processes, the control is
manual to monitor and show the search results naturally.
7
4. Other Types of Search Engines
Besides the above three major types, search engines can be classified into many other categories depending upon the usage. Below are
some of the examples:
• Search engines have different types of bots for exclusively displaying images, videos, news, products and local listings. For
example, Google News page can be used to search only news from different newspapers.
• Some of the search engines like Dogpile collects meta information of the pages from other search engines and directories to display in
the search results. This type of search engines are called metasearch engines.
• Semantic search engines like Swoogle provide accurate search results on specific area by understanding the contextual meaning of the
search queries.
8
A search engine comprises four essential modules :
1. A document processor
2. A query processor
3. A search and matching function
4. A ranking capability
While users focus on "search," the search and matching function is only one of the four modules. Each of these four modules may
cause the expected or unexpected results that consumers get when they use a search engine.
9
10
DATA FLOW DIAGRAM (DFD)
11
Fig : DFD level 1
12
E-R DIAGRAM
13
Image to define Search Engine
14
MILESTONE
6. Final Presentation
7. Project Submission
MEETING WITH THE SUPERVISOR
16
Bibliography & References
In the conclusion we can say that it will be very effective and user friendly to use. Person with minimal knowledge regarding
computer and internet can take advantage from it. Huge amount of data can be stored easily in error free manner. It displays
the popular information’s first for which we get access to important information’s in no time. Though every task is never said
to be perfect in this development field even more improvement may be possible in this system.
• www.slideshare.net
• www.ijarcce.com
17