0% found this document useful (0 votes)

31 views

A Customized Hiring Process: Problem Statement

This document describes a customized hiring process system that uses machine learning algorithms like KNN and SVM to classify candidates and resumes. It analyzes candidates' test answers to recommend suitable companies and also classifies resumes by domain for hiring purposes. The system architecture includes modules for registration, testing, analysis, classification, recommendations and resume management. It aims to customize the hiring process for different clients and provide hybrid recommendations based on multiple criteria.

Uploaded by

prasanna

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views

A Customized Hiring Process: Problem Statement

Uploaded by

prasanna

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 9

A customized hiring process

INTRODUCTION

There are lot of companies which are available in the market and there are many
training sites which help the candidates both fresher’s as well as experienced in order to
prepare the people and help them find a suitable role in the industry. In the same way the
company might have screened dozens of applicants, vetted a select few through multiple
stages of your hiring process, and now you’re down to the final two candidates. First there is a
candidate. Interviewing her is like playing a great game of tennis. You serve the question and
she smashes it right back with a well-crafted answer. At times your conversation is like
the perfect rally. You cannot fault her game. At the same time there is one more candidate On
paper she looks great. But she is stumbling, struggling to find her feet. She is just not giving
you any kind of game. You think she can do it, but she is not convincing you.

Problem Statement
As the new technologies are evolving day by day, the human resourcing is facing peculiar
challenges in meeting the requirements from client to client. The same set of resumes for a same
JD doesn’t work for all the clients. As every organization carries a different point about a resume
while reading through the resume. Barely matching skills and experience is no more important
alone for the serious organizations. For example, some companies consider the Domain expertise
but some other gives more importance to the number of skills and total yea rs of professional
experience. Human Resource (HR) agencies use various head hunting tools and online search
methods. These search methods connected with the database of millions of resumes.

There is no portal as such for the candidates to find out based on taking certain kind of
tests what kind of company suits them. Hence an effort is made in this project to judge the
candidate capabilities and find the cluster for the candidate based on the answer analysis using
KNN classifier.

METHODOLOGY

[Type text] Page 1

A customized hiring process

Angular/ Ext JS View

with JSP

TOMCAT Web Container

Data Layer using MySQL

Middle Ware –
Controlling layer

Delegate Layer

Registration Service Login Service Question Creation Service

Prepare Company List for

K NN Classification Cluster
Test Analysis

Resume Upload

Candidate Link Recommendations

Preparation Service Classification and Search of
Resume

Fig: System Architecture

Angular/Ext JS View With JSP

[Type text] Page 2

A customized hiring process

This module is responsible for generation of views for the front end using angular and ext
js framework along with java server pages.

TOMCAT Web Container

There are many servers available in the market which is responsible for handling the web
requests. Most of the other servers are heavy weight and also are commercial in nature. Here we
make use of open source and light weight tomcat server.

Middle Ware – Controlling layer

This module is responsible for handling the web request and forwarding it to the
authentication layer. This also performs the basic validations like empty checks and regex
validations. If any validation fails then response is send to the front end otherwise the request is
forwarded to the authentication layer and respective services.

Delegate Layer

This layer is used to call the respective service in order to perform a specific task.

Registration Service

This module is responsible for allowing the user to register into the application by
providing details like Name, Password, Confirm Password, Email Id, Gender, Phone No, City,
State and Country

This module is responsible for allowing the user in order to provide the username and
password and login into the application either as an administrator or as a user.

Question Creation Service

This module is used by an administrator in order to create a set of questions and each
question will have the following information.

1) Type, 2) Question Description, 3) Answer1, 4) Answer2, 5) Answer3, 6) Answer 4

7) Rating1, 8) Rating2, 9) Rating 3 , 10) Rating 4 and 11) QuestionID

Test Analysis

[Type text] Page 3

A customized hiring process

This module is responsible for providing questions to the user which will contain
aptitude, technical and general questions and then perform analysis for all the answers and then
generate a matrix which can act as an input to the k nn classification algorithm

K NN Classification

This module is responsible for taking the answers total rating and then performing the
count of nearest neighbors across the 3 clusters like Norm Package, Medium Package and High
Package and then predict this candidate is suitable for which kind of package.

Prepare Company List for Cluster

This module is responsible for creating company information like company name,
company url, company desc along with the cluster (Norm, Medium or High) so that after the
classification is done by Knn the list of companies can be provided based on the cluster.

Candidate Preparation Link

This module is responsible for providing the links for the candidates in order to prepare
for the aptitude, technical and general questions. Each Preparation Item will contain Name, Link
and Category.

Recommendations Service

This module will provide the recommendations of which companies are most suitable for
the candidate.

Resume Upload

The candidate will be able to upload the resume

Data Cleaning of Resumes

The Data Cleaning algorithm is responsible for removal of stop words. Each of resumes
are cleaned by removing the stop words from reviews. These are the set of words which do not
have any specific meaning. The data mining forum has defined set of keywords which do not
have any meaning like a, able, about, across, after, all, almost, also, am, among, an etc

Tokenization of Resumes

[Type text] Page 4

A customized hiring process

Tokenization is a process of converting the clean data into a set of words known as
tokens
Frequency Computation of Resumes

This is a process in which the frequency computation is performed. For each of the
th th
reviews the frequency is computed. Frequency is number of times a i token appears in j .
Resume.
TF-IDF Computation of Resumes

This module is used to compute the Inverse document frequency based on the number of
resumes and then frequency of the resume.

Classification of Domain for Resume

This module is responsible for training the support vector machine based on the test data
set and then performs the attributes frequency. Find appropriate kernel and then classify the
domain to which the resume mostly belongs to. The module also computes the distance and then
classifies the domain to which the resumes belong to.
Ranking of Resumes

The entire query is divided into tokens and then frequency of those tokens across the
various resumes is found and then finally the resumes are ranked based on descending order of
the resume.

Hybrid Recommendations based on Association Rule Mining

This module is to combine multiple criteria of the resume and then rank the best resumes
based on the requirements of multi attribute searches by doing intersection of the set of various
algorithms.

[Type text] Page 5

A customized hiring process

Objectives

1. The first objective is to perform the classification of candidates using KNN machine
learning algorithm for various companies- HIGH, MEDUIM and LOW Package.
2. The second objective is to perform the recommendations of list of companies to the
candidate based on the answer analysis
3. The third objective is to classify the resume into testing and development profiles using
SVM
4. The fourth objective is to provide the HR the capability of ranking the resumes based on
specific criteria keywords and then rank the resume that best suits the requirement based
on modified feature vector

Hardware Requirements

Sl No Parameter Description
1 RAM 4GB - 8GB
2 Hard Disk 500GB – 1TB

[Type text] Page 6

A customized hiring process

Software Requirements

Sl No Parameter Name Parameter Value

1 Development Language JAVA
2 Java Development Kit Version Jdk 1.6
3 Java Run Time Environment Jre 6
4 Database for Routing Tables Backend MySQL
5 Database Front End for Routing Tables Heildi SQL
7 Development Tool Eclipse
8 Sever Type Web Server
9 Web Server Tomcat 8.0
11 Framework Used Spring Framework
12 View Technology Used Java Server Pages
13 Designing Cascading Style Sheets

End User requirements

ReqID Requirement Name Requirement Description

1 Registration This module is responsible for registration of the user
2 Login This module is responsible for performing the login
functionality and then obtain either Admin or Customer
3 Question Creation This module is responsible for creating the questions
4 Test Analysis This module is responsible for performing the analysis of
the test
5 Company Information This module is responsible for saving of the company
information like company name, company url and company

[Type text] Page 7

A customized hiring process

image url for each of the 3 clusters

6 Training Data This is sample data set which will have the company
information for aptitude, general and technical with rating
and resultant companies
7 Prepare Link This is used to provide the preparation links for helping the
candidate in the preparation
8 Resume Upload Used to storage of the resume
9 Data Cleaning Used for removal of stopwords from the resume
10 Tokenzation Used for converting statements in the resume into set of
words
11 Frequency Used for removal of redundancy
Computation
12 TF-IDF Responsible for computing important keywords in resume
13 Ranking This is used to rank resumes by HR
14 Classification This is responsible for classification of resumes into QA
and Development

References

[1] Gregory A. Wilkin ; Xiuzhen Huang,"K-Means Clustering Algorithms: Implementation and

Comparison", Second International Multi-Symposiums on Computer and Computational
Sciences (IMSCCS 2007)

[2] Shi Na ; Liu Xumin ; Guan Yong, "Research on k-means Clustering Algorithm: An Improved
k-means Clustering Algorithm", 2010 Third International Symposium on Intelligent Information
Technology and Security Informatics, 22 April 2010

[Type text] Page 8

A customized hiring process

[3] Jie Chen,1 Chunxia Zhang,2 and Zhendong Niu,"A Two-Step Resume Information Extraction
Algorithm", Received 16 August 2017; Revised 26 February 2018; Accepted 26 March 2018;
Published 8 May 2018

[4] Thomas Schmitt, Philippe Caillou, and Michele Sebag,"Matching Jobs and Resumes: a Deep
Collaborative Filtering Task",EPiC Series in Computing

[5] Tsung-Hsien Chiang, Hung-Yi Lo,Shou-De Lin,"A Ranking-based KNN Approach for
Multi-Label Classification",Graduate Institute of Computer Science and Information Engineering

National Taiwan University

[6] Junjie Wu, Advances in K-means Clustering, Springer-Verlag Berlin Heidelberg, 2012.
[7] Jure Leskovec, Anand Rajaraman, Jeffrey D. Ullman, Mining of Massive Datasets, Stanford
Infolab, 2014.
[8] Michael Steinbach, Vipin Kumar, Pang-Ning Tan, Introduction to Data Mining, Pearson
Publications, 2006.
[9] Yanchang Zhao, R and Data Mining: Examples and Case Studies, 2013.

[Type text] Page 9

OpenScape Business V3 Security Checklist Issue 5
No ratings yet
OpenScape Business V3 Security Checklist Issue 5
167 pages
STP 9000
No ratings yet
STP 9000
4 pages
C202 Planning Course Manual Sept 2011 PDF
No ratings yet
C202 Planning Course Manual Sept 2011 PDF
135 pages
Online Examination System
No ratings yet
Online Examination System
84 pages
Santosh Palivela (React & .Net Developer)
No ratings yet
Santosh Palivela (React & .Net Developer)
4 pages
Profile Summary:: Frameworks
No ratings yet
Profile Summary:: Frameworks
4 pages
Job Miller
No ratings yet
Job Miller
15 pages
Sara Technologies: Suresh Kanna.S
No ratings yet
Sara Technologies: Suresh Kanna.S
8 pages
KNKN
No ratings yet
KNKN
6 pages
Naresh Resume
No ratings yet
Naresh Resume
8 pages
PramodhKumar_Informatica
No ratings yet
PramodhKumar_Informatica
7 pages
Career Objective: Worked On SQL Functions, RDBMS Concepts Like Constraints
No ratings yet
Career Objective: Worked On SQL Functions, RDBMS Concepts Like Constraints
4 pages
Kshay UPE: Check Points, Synchronizations
No ratings yet
Kshay UPE: Check Points, Synchronizations
5 pages
Company Name: Position: Company Location: Experience: Shared By: No of Rounds: Updated On: My Personal Experience
No ratings yet
Company Name: Position: Company Location: Experience: Shared By: No of Rounds: Updated On: My Personal Experience
6 pages
Santosh Kumar
No ratings yet
Santosh Kumar
4 pages
Software Engineering Concepts
No ratings yet
Software Engineering Concepts
29 pages
Java 3+ Exp Satyam
No ratings yet
Java 3+ Exp Satyam
5 pages
Ashok 13yrs Banglore Functionl Testing
No ratings yet
Ashok 13yrs Banglore Functionl Testing
6 pages
Review-I: Internal Guide'S Specimen External Guide'S Specimen
No ratings yet
Review-I: Internal Guide'S Specimen External Guide'S Specimen
20 pages
Oracle Enterprise Data Quality 12C Contents Online
No ratings yet
Oracle Enterprise Data Quality 12C Contents Online
5 pages
TejpalSingh (7 0)
No ratings yet
TejpalSingh (7 0)
6 pages
Adeptia
No ratings yet
Adeptia
3 pages
Naukri ThorthiNaveen (5y 6m)
No ratings yet
Naukri ThorthiNaveen (5y 6m)
5 pages
Naga Sridevi: Email: Phone: 9032314870
No ratings yet
Naga Sridevi: Email: Phone: 9032314870
3 pages
Bhaskar_CV
No ratings yet
Bhaskar_CV
5 pages
Mitesh Agrawal - 2 Yrs Testing Experience
No ratings yet
Mitesh Agrawal - 2 Yrs Testing Experience
3 pages
Resume
No ratings yet
Resume
5 pages
Saladhivenkatesh E-Mail:: Web Services Testing by Using Soup Ui
No ratings yet
Saladhivenkatesh E-Mail:: Web Services Testing by Using Soup Ui
5 pages
Learning Dynamics NAV Patterns: Create solutions that are easy to maintain, are quick to upgrade, and follow proven concepts and design
From Everand
Learning Dynamics NAV Patterns: Create solutions that are easy to maintain, are quick to upgrade, and follow proven concepts and design
Marije Brummel
No ratings yet
Online Tax Management System Project Report
No ratings yet
Online Tax Management System Project Report
174 pages
Swarupa C
No ratings yet
Swarupa C
4 pages
Siva Resume
No ratings yet
Siva Resume
6 pages
NAGA
No ratings yet
NAGA
6 pages
ResumeRecomendationSystemThrough AI
No ratings yet
ResumeRecomendationSystemThrough AI
33 pages
Srikanth 154797
No ratings yet
Srikanth 154797
6 pages
ISG Mat
No ratings yet
ISG Mat
103 pages
(INV0006) Copy Inventory Organization - Simplifying Oracle E Business Suite
0% (1)
(INV0006) Copy Inventory Organization - Simplifying Oracle E Business Suite
3 pages
Java interview questions and answers on code quality
No ratings yet
Java interview questions and answers on code quality
26 pages
Testing Resume 3.4 Years
No ratings yet
Testing Resume 3.4 Years
3 pages
Vishal Jadhav - Resume..54-Converted..3
No ratings yet
Vishal Jadhav - Resume..54-Converted..3
3 pages
Student Management System
No ratings yet
Student Management System
16 pages
MIS604 Assessment 2 Brief ELINK-converted Elink
No ratings yet
MIS604 Assessment 2 Brief ELINK-converted Elink
8 pages
Varma Automation CV
No ratings yet
Varma Automation CV
5 pages
ERP Implementation Fundamentals: Richard Byrom Oracle Consultant, Speaker and Author
No ratings yet
ERP Implementation Fundamentals: Richard Byrom Oracle Consultant, Speaker and Author
23 pages
Employment Database Management System2
No ratings yet
Employment Database Management System2
22 pages
Testyou Documentation PHP
No ratings yet
Testyou Documentation PHP
60 pages
Advance Java 33333
No ratings yet
Advance Java 33333
15 pages
CV - Vikash - Kumar Singh - Java - Developer - 10+yrs
No ratings yet
CV - Vikash - Kumar Singh - Java - Developer - 10+yrs
8 pages
ETL - Interview Question&Answers-2
No ratings yet
ETL - Interview Question&Answers-2
51 pages
Smruti Ranjan Mohanty
No ratings yet
Smruti Ranjan Mohanty
3 pages
4 Plus Resume
No ratings yet
4 Plus Resume
7 pages
Curriculum Vitae Rahul Nilangekar
No ratings yet
Curriculum Vitae Rahul Nilangekar
5 pages
Project Overview: 1. Create User Account
No ratings yet
Project Overview: 1. Create User Account
80 pages
7-10 Years Experience
No ratings yet
7-10 Years Experience
4 pages
Rajeevranjan Business App Kuwait
No ratings yet
Rajeevranjan Business App Kuwait
8 pages
Mage SH
100% (2)
Mage SH
6 pages
02 Requirement Engineering
No ratings yet
02 Requirement Engineering
51 pages
International School of Management and Technology
No ratings yet
International School of Management and Technology
8 pages
CV - Rajesh Kumar - Oracle Weblogic Admin
No ratings yet
CV - Rajesh Kumar - Oracle Weblogic Admin
3 pages
Database Guides
No ratings yet
Database Guides
4 pages
Ravindar Reddy
No ratings yet
Ravindar Reddy
4 pages
Dasari Ramanaiah Testing CV3 1
No ratings yet
Dasari Ramanaiah Testing CV3 1
3 pages
Kishor_Kunal_Automation_3.5yrs
No ratings yet
Kishor_Kunal_Automation_3.5yrs
4 pages
Interfacing Gocator To Aurora Vision Studio
No ratings yet
Interfacing Gocator To Aurora Vision Studio
12 pages
Vuln Scan
No ratings yet
Vuln Scan
12 pages
Zalsm Excel To Internal Table
No ratings yet
Zalsm Excel To Internal Table
3 pages
Linux Kernel Labs
No ratings yet
Linux Kernel Labs
49 pages
Secure Your Kloxo Installation With Your Firewall/IPTABLES
No ratings yet
Secure Your Kloxo Installation With Your Firewall/IPTABLES
4 pages
Delta DVP Series Users Manual
No ratings yet
Delta DVP Series Users Manual
182 pages
Database Mapping in Sterling Integrator
No ratings yet
Database Mapping in Sterling Integrator
9 pages
Yokogawa Training Centre Uae
No ratings yet
Yokogawa Training Centre Uae
32 pages
SBS Log Collection For Conuncondiional Log
No ratings yet
SBS Log Collection For Conuncondiional Log
11 pages
In PHP dateTime Filter
No ratings yet
In PHP dateTime Filter
3 pages
Inspire Awards-Manak: E-Mias Manual For School Authority
No ratings yet
Inspire Awards-Manak: E-Mias Manual For School Authority
57 pages
Civil Users Guide
No ratings yet
Civil Users Guide
2,574 pages
Important Instructions To Examiners:: Q. No - Sub Q.N. Answer Marking Scheme
No ratings yet
Important Instructions To Examiners:: Q. No - Sub Q.N. Answer Marking Scheme
20 pages
Analytics All Web Site Data Acquisition Overview 20171031-20171106
No ratings yet
Analytics All Web Site Data Acquisition Overview 20171031-20171106
1 page
Lab04 - Introduction To Shell Programming
No ratings yet
Lab04 - Introduction To Shell Programming
7 pages
Resume CV Ahirwar
No ratings yet
Resume CV Ahirwar
4 pages
Hcxhash 2 Cap
No ratings yet
Hcxhash 2 Cap
32 pages
Course Presentation GoogleCloudProfessionalCloudDeveloper
100% (1)
Course Presentation GoogleCloudProfessionalCloudDeveloper
443 pages
Java Programming Chapter 3 GUI With Javafx
No ratings yet
Java Programming Chapter 3 GUI With Javafx
15 pages
Mathematica Link For LabVIEW
No ratings yet
Mathematica Link For LabVIEW
223 pages
V24 Getting Started Manual
No ratings yet
V24 Getting Started Manual
354 pages
PlugY The Survival Kit - Readme
No ratings yet
PlugY The Survival Kit - Readme
15 pages
Proen Corp
No ratings yet
Proen Corp
1 page
TensorRT Installation Guide
No ratings yet
TensorRT Installation Guide
38 pages
SAP Fiori Elements Expert Paper
100% (1)
SAP Fiori Elements Expert Paper
12 pages
Ludo VB
No ratings yet
Ludo VB
2 pages
InDesign - 20 Free Scripts PDF
No ratings yet
InDesign - 20 Free Scripts PDF
9 pages