0% found this document useful (0 votes)

124 views17 pages

SEARCH ENGINE (Synopsis) - Vivek

The document discusses the objectives and design of a search engine project created using Java. It aims to provide users information from the web in a fast and easy way by allowing keyword searches. The search engine will index web pages and their content to return relevant results. It will utilize techniques like tags, meta tags, and keyword density to determine the priority of search results. The project requirements include creating a database to store information and interfaces for user access and searching.

Uploaded by

Alok Mishra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

124 views17 pages

SEARCH ENGINE (Synopsis) - Vivek

Uploaded by

Alok Mishra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

SEARCH ENGINE

SYNOPSIS
OF SEARCH ENGINE USING JAVA

BATCHELOR OF COMPUTER APPLICATION

SUBMITTED BY

VIVEK PANDEY
Batch Year – 2019-22
Enrollment No. – U196073

PROJECT GUIDE
ER. DHANANJAY SINGH

Centre of Computer Education & Training

Institude of Professional Studies
University of Allahabad, Prayagraj
Uttar Pradesh 1
INTRODUCTION

The search engine concepts particularly based on the search the information on the World Wide Web.
Most engines allow you to type in a few words, and then search for occurrences of these words
in their data base. If you just type words into the "basic search" interface you get from the search engine's main page. Basically we are
trying to create a search engine named "JUST BROWSE" which is searching for your city ,basic contact and descriptive information of
your city firms and other basic need places.
Most engines have separate advanced search forms
where you can be more specific, and form complex Boolean searches. Some search tools parse HTML tags, allowing you to look for
things specifically as links, or as a title or URL without consideration of the text on the page. We here not tying to use that we just have
simply keywords to be search they are also being categorized under some areas where search can be more filtered the idea behind the
website to make the city information on web through a search engine the concept was first implemented manually through "yellow pages
book" now only the metropoltican cities have them and there online implementation is also done but here we are also going to
implement the rural areas near us to web or internet what we say .

PROBLEM DEFINITION

I have been keeping some issues of the most common search engine ranking problems, and have come up with a point of what I would
say are the “most common” search engine ranking problems. Many of these issues can easily be fixed, of course it will take time or
money (or both).
Lack of links , repetitive title tracks , too many 404 error , unclean URLs , slow web page load time .

MOTIVATION

Information is a very essential need for the peoples related to corporate, social, political, entertainment and educational sectors. It is
quite difficult to get required information’s in short time. By using internet resources we can get those information’s but it is time
consuming as we have search through various sites randomly as we can’t say about the exact place where we can get the required
information’s. The development in various aspects of computer technology has reached beyond our imagination & expectations. Every
now and then, new technologies are launched in the market to ease our daily works. This fact inspired me to develop a web search
engine which will help in finding information’s from internet very easily and in fast way. Users can access huge amount of information’s
in no time by just giving a keyword as input and within a moment all information’s related to that keyword will be indexed on the screen
according to their popularity.

2
OBJECTIVE

Search Engine project will able to provide users required information at one particular place by using the words and patterns
entered by the user during their search operation. All the information will be provided over the browser screen where users can select
appropriate link filtered by the search query. Whatever the information presented to the user can be in any form by default such as it may
be in the form of web pages, pdf file, doc file etc. The search query will provide listing of web pages as per their occurrence during
search operations.

This search mechanism will work on the concept of tags and meta tags which are used
while writing the contents under the particular web pages. If the user’s query don’t matched with the tags and meta tags then it will go
for summary section to match the given words in order to present the exact output or results. Upon going through web pages an index
file will be created where listing of pages will be done by the system and present them as per their index number. Keyword density for a
particular post under a particular web page will also help the system to set the priority for indexing the web page.

3
REQUIREMENT ANALYSIS

Software Analysis

• Phase Analysis

During the feasibility phase, first of all I tried to find out what are the requirements for my project, and come up with a solution that to
build a search engine I require a database which would contain all the information’s like images, audios and videos and an interface
through which user can access those information’s. So for making the database I require Wamp and for the interface I need PHP and
HTML pages. Thus during the Requirement analysis & specification phase I collected all the data that would be stored to the database
i.e. images, audios, videos.
At the designing phase I designed the database which contains designing of
tables and setting up of the table fields, data types, required fields, primary keys etc. next I have designed the interface that contains
designing of interface using HTML and JAVA.
During the coding and unit testing phase we divided my system design into modules and start writing codes for
those modules, after writing codes for each module, and tested each of them to check whether it is working correctly or not. During
integration and system testing phase, Integration of different modules is undertaken once the different modules have been coded and unit
tested in a planned manner. Though integrating all the modules is not done in one shot, during each integration step, previously planned
modules are added to the partially integrated system and the resultant system is tested. After all the modules have been successfully
integrated and tested, then we have carried out system testing to check whether the fully developed system conforms to its requirement
specification or not in order to ensure that the final product is error free.

4
Study Design

5
Types of Search Engine

Search engines are classified into the following three categories based on how it works.

1. Crawler based search engines

2. Human powered directories
3. Hybrid search engines
4. Other special search engines

1. Crawler Based Search Engines

All crawler based search engines use a crawler or bot or spider for crawling and indexing new content to the search database. There are
four basic steps, every crawler based search engines follow before displaying any sites in the search results.

• Crawling
• Indexing
• Calculating Relevancy
• Retrieving the Result

Most of the popular search engines are crawler based search engines and use the above technology to display search results.
Example of crawler based search engines:
• Google
• Bing
• Yahoo!
• Baidu
• Yandex
• Besides these popular search engines there are many other crawler based search engines available like DuckDuckGo, AOL and Ask.

6
2. Human Powered Directories

Human powered directories also referred as open directory system depends on human based activities for listings. Below is how the
indexing in human powered directories work:

• Site owner submits a short description of the site to the directory along with category it is to be listed.
• Submitted site is then manually reviewed and added in the appropriate category or rejected for listing.
• Keywords entered in a search box will be matched with the description of the sites. This means the changes made to the content of a
web pages are not taken into consideration as it is only the description that matters.
• A good site with good content is more likely to be reviewed for free compared to a site with poor content.
• Yahoo! Directory and DMOZ were perfect examples of human powered directories. Unfortunately, automated search engines
like Google, wiped out all those human powered directory style search engines out of the web.

3. Hybrid Search Engines

Hybrid Search Engines use both crawler based and manual indexing for listing the sites in search results. Most of the crawler based
search engines like Google basically uses crawlers as a primary mechanism and human powered directories as secondary mechanism.
For example, Google may take the description of a webpage from human powered directories and show in the search results. As human
powered directories are disappearing, hybrid types are becoming more and more crawler based search engines.
But still
there are manual filtering of search result happens to remove the copied and spammy sites. When a site is being identified for spammy
activities, the website owner needs to take corrective action and resubmit the site to search engines. The experts do manual review of the
submitted site before including it again in the search results. In this manner though the crawlers control the processes, the control is
manual to monitor and show the search results naturally.

7
4. Other Types of Search Engines

Besides the above three major types, search engines can be classified into many other categories depending upon the usage. Below are
some of the examples:
• Search engines have different types of bots for exclusively displaying images, videos, news, products and local listings. For
example, Google News page can be used to search only news from different newspapers.
• Some of the search engines like Dogpile collects meta information of the pages from other search engines and directories to display in
the search results. This type of search engines are called metasearch engines.
• Semantic search engines like Swoogle provide accurate search results on specific area by understanding the contextual meaning of the
search queries.

8
A search engine comprises four essential modules :
1. A document processor
2. A query processor
3. A search and matching function
4. A ranking capability

While users focus on "search," the search and matching function is only one of the four modules. Each of these four modules may
cause the expected or unexpected results that consumers get when they use a search engine.

9
10
DATA FLOW DIAGRAM (DFD)

fig : DFD level 0

11
Fig : DFD level 1
12
E-R DIAGRAM

13
Image to define Search Engine

14
MILESTONE

S.No. Project Activity Estimated Start Estimated End

Date Date

1. Synopsis Submission 16/09/2021 30/09/2021

2. Powerpoint Presentation 27/09/2021 28/09/2021

3. Core code 15/09/2021

4. Presentation 1 28/09/2021 28/09/2021

5. Presentation 2 28/09/2021 28/09/2021

6. Final Presentation

7. Project Submission
MEETING WITH THE SUPERVISOR

Date of Mode Comments by Signature of the

the meet the supervisor Supervisor
19/09/2021 Online Send some videos
about how to make
synopsis who don’t
know.
26/09/2021 Online Check our presentation
in online meeting and
discuss on it about the
changes .
27/09/2021 Online Instruction about the
presentation and
synopsis , what should
be compulsory to add
in presentation and
synopsis.

16
Bibliography & References

In the conclusion we can say that it will be very effective and user friendly to use. Person with minimal knowledge regarding
computer and internet can take advantage from it. Huge amount of data can be stored easily in error free manner. It displays
the popular information’s first for which we get access to important information’s in no time. Though every task is never said
to be perfect in this development field even more improvement may be possible in this system.

• www.slideshare.net
• www.ijarcce.com

All in One Dorks Making From Beginner To Expert Ebook The HQ Ever by Don 1
80% (5)
All in One Dorks Making From Beginner To Expert Ebook The HQ Ever by Don 1
29 pages
SEO Skill Assessment Fiverr Test 1
100% (1)
SEO Skill Assessment Fiverr Test 1
22 pages
Diggity SEO On Site SEO Guide v1.11 PDF
No ratings yet
Diggity SEO On Site SEO Guide v1.11 PDF
34 pages
Web Development
No ratings yet
Web Development
11 pages
Complete Guide To Social Video
No ratings yet
Complete Guide To Social Video
49 pages
Certificate
No ratings yet
Certificate
59 pages
Google Adsense Zontha Blueprint
No ratings yet
Google Adsense Zontha Blueprint
2 pages
CSC111 Introduction To Computer Science
No ratings yet
CSC111 Introduction To Computer Science
30 pages
Web Analytics, Web Mining, and Social Analytics
No ratings yet
Web Analytics, Web Mining, and Social Analytics
53 pages
Flipkart Strategic Business Management
0% (1)
Flipkart Strategic Business Management
20 pages
Company Name Business Plan: Owner'S Name Insert Address Phone
No ratings yet
Company Name Business Plan: Owner'S Name Insert Address Phone
33 pages
KAG Sacco (Without The Portal)
0% (1)
KAG Sacco (Without The Portal)
15 pages
Kathleen Livingstone Resume/CV 2019
No ratings yet
Kathleen Livingstone Resume/CV 2019
2 pages
Drawing Register Reference Manual
No ratings yet
Drawing Register Reference Manual
29 pages
Major Project PROPOSAL-BACHELOR OF ENGINEERING
No ratings yet
Major Project PROPOSAL-BACHELOR OF ENGINEERING
37 pages
Carlos Benjamin 1
No ratings yet
Carlos Benjamin 1
40 pages
Search Engine Optimization (SEO) : (Day One)
No ratings yet
Search Engine Optimization (SEO) : (Day One)
16 pages
SQL Access
No ratings yet
SQL Access
49 pages
Guide 9 - LinkedIn Traffic Rush
No ratings yet
Guide 9 - LinkedIn Traffic Rush
10 pages
Dizzying But Invisible Depth: Jean-Baptiste Queru
100% (1)
Dizzying But Invisible Depth: Jean-Baptiste Queru
40 pages
A Study On Effectiveness of Digital Marketing Services at Victory Software Solutions
No ratings yet
A Study On Effectiveness of Digital Marketing Services at Victory Software Solutions
11 pages
Project Front Pages-IWT
No ratings yet
Project Front Pages-IWT
24 pages
Caso 2 - YOB Bank PDF
No ratings yet
Caso 2 - YOB Bank PDF
10 pages
Term Paper OF Int-301: Web Programming: Topic: Search Engine
No ratings yet
Term Paper OF Int-301: Web Programming: Topic: Search Engine
18 pages
Boycheva CV
No ratings yet
Boycheva CV
2 pages
Gmit Thesis
100% (3)
Gmit Thesis
8 pages
Project Report On "Andrews Library Management System": Gandhinagar
No ratings yet
Project Report On "Andrews Library Management System": Gandhinagar
75 pages
Huge List of Business Ideas in The Philippines
No ratings yet
Huge List of Business Ideas in The Philippines
12 pages
Web Engineering Solve Updated PDF
No ratings yet
Web Engineering Solve Updated PDF
77 pages
Business Research Methods Notes
No ratings yet
Business Research Methods Notes
106 pages
WWW - Kashipara.in: A Project Report On
No ratings yet
WWW - Kashipara.in: A Project Report On
58 pages
Seminar Report
100% (4)
Seminar Report
44 pages
Kashish Chandekar MB22112 Sip PFD
No ratings yet
Kashish Chandekar MB22112 Sip PFD
27 pages
Office File Management System
No ratings yet
Office File Management System
44 pages
Impact of Digital Marketing in Start-Up
No ratings yet
Impact of Digital Marketing in Start-Up
6 pages
Final Project Report: Donate Medicine For Needy
No ratings yet
Final Project Report: Donate Medicine For Needy
44 pages
AWP Module2 PHP
No ratings yet
AWP Module2 PHP
26 pages
Summary Chapter 2
No ratings yet
Summary Chapter 2
2 pages
Smart Parking
No ratings yet
Smart Parking
39 pages
Path Visualizer: Gaurav Rana 01 Abhishek Kumar Singh 17 Adarsh Singh 18 Mentor-Mrs. Huda Khan
No ratings yet
Path Visualizer: Gaurav Rana 01 Abhishek Kumar Singh 17 Adarsh Singh 18 Mentor-Mrs. Huda Khan
18 pages
A Project Report For BCA
No ratings yet
A Project Report For BCA
89 pages
YouTube Transcript Summarizer PPT Final
100% (1)
YouTube Transcript Summarizer PPT Final
9 pages
CB 17 Black Book
No ratings yet
CB 17 Black Book
47 pages
Cloud Computing
No ratings yet
Cloud Computing
28 pages
A System To Filter Unwanted Messages From Osn User Walls
0% (1)
A System To Filter Unwanted Messages From Osn User Walls
19 pages
Daily SEO Learning
No ratings yet
Daily SEO Learning
5 pages
Text Summarization On Youtube Videos in Educational Domain
No ratings yet
Text Summarization On Youtube Videos in Educational Domain
5 pages
Project - Report
No ratings yet
Project - Report
56 pages
Detection of Fake Online Reviews Using Semi Supervised and Supervised Learning
No ratings yet
Detection of Fake Online Reviews Using Semi Supervised and Supervised Learning
4 pages
Web Development
No ratings yet
Web Development
23 pages
Data Mining of Restaurant Review Using W PDF
No ratings yet
Data Mining of Restaurant Review Using W PDF
4 pages
Final Year Project Ideas
100% (1)
Final Year Project Ideas
7 pages
HTML Advantage and Disadvantage
No ratings yet
HTML Advantage and Disadvantage
4 pages
Resume Screening Using Machine Learning
No ratings yet
Resume Screening Using Machine Learning
5 pages
Benchmark Study of Desktop Search Tools
100% (1)
Benchmark Study of Desktop Search Tools
15 pages
Mini Project Report - Format (2023-24) (AI)
No ratings yet
Mini Project Report - Format (2023-24) (AI)
17 pages
Freelancing Platform Synopsis
No ratings yet
Freelancing Platform Synopsis
9 pages
Certificate: Kamala Education Society's
No ratings yet
Certificate: Kamala Education Society's
25 pages
Search Engine Problems and Solutions
No ratings yet
Search Engine Problems and Solutions
2 pages
Movie Recommendation System Using Machine Learning
No ratings yet
Movie Recommendation System Using Machine Learning
23 pages
Miniproject Sample Report
No ratings yet
Miniproject Sample Report
15 pages
Simple Mail Service
No ratings yet
Simple Mail Service
50 pages
ROSPL Report SEO
No ratings yet
ROSPL Report SEO
18 pages
Indira Gandhi National Open University
No ratings yet
Indira Gandhi National Open University
37 pages
Summative Test Module 2
No ratings yet
Summative Test Module 2
1 page
Project Introduction: Chapter-1
No ratings yet
Project Introduction: Chapter-1
29 pages
Tmi 4013 Revision V 5
No ratings yet
Tmi 4013 Revision V 5
6 pages
CSMS Project Report
No ratings yet
CSMS Project Report
48 pages
Crime Reporting System
No ratings yet
Crime Reporting System
30 pages
Develop Static Pages (Using Only HTML) of An Online Book Store. Should Consist The Following Pages
No ratings yet
Develop Static Pages (Using Only HTML) of An Online Book Store. Should Consist The Following Pages
96 pages
Manual WT
No ratings yet
Manual WT
6 pages
Web Lab Report 2
No ratings yet
Web Lab Report 2
6 pages
Mini Project Report - Format (2023-24) (AI)
No ratings yet
Mini Project Report - Format (2023-24) (AI)
20 pages
H.V.P.M's College of Engineering and Technology, Amravati
No ratings yet
H.V.P.M's College of Engineering and Technology, Amravati
23 pages
Synopsis Main
No ratings yet
Synopsis Main
9 pages
PHP - Project Titles
No ratings yet
PHP - Project Titles
7 pages
(Source Code Repository) : Project
No ratings yet
(Source Code Repository) : Project
10 pages
Web Development and Designing: Summer Internship Report
No ratings yet
Web Development and Designing: Summer Internship Report
24 pages
College Website Creation
No ratings yet
College Website Creation
36 pages
Diet Management Systeam
No ratings yet
Diet Management Systeam
5 pages
Minor Project Synopsis Format CSE PPIMT fINAL - G2 Group
No ratings yet
Minor Project Synopsis Format CSE PPIMT fINAL - G2 Group
12 pages
BBA Digital Marketing
No ratings yet
BBA Digital Marketing
34 pages
Emergency Ambulance Hiring MS
No ratings yet
Emergency Ambulance Hiring MS
42 pages
Synopsis On Website Builder
No ratings yet
Synopsis On Website Builder
7 pages
Rukaiya Munim CV
No ratings yet
Rukaiya Munim CV
3 pages
AI Agents-Top 25 Use Cases Transforming Industries
No ratings yet
AI Agents-Top 25 Use Cases Transforming Industries
38 pages
Major Project of Ai Mock REPORT
No ratings yet
Major Project of Ai Mock REPORT
47 pages
Digital Marketing Internship Report
No ratings yet
Digital Marketing Internship Report
16 pages
Trackpad Pro Ver. 5.0 Class 6
From Everand
Trackpad Pro Ver. 5.0 Class 6
Nidhi Arora
No ratings yet
Touchpad Plus Ver. 1.1 Class 7
From Everand
Touchpad Plus Ver. 1.1 Class 7
Nisha Batra
No ratings yet