0% found this document useful (0 votes)

12 views

Fake Jobs Code

hmgvhjg

Uploaded by

Sushant K

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views

Fake Jobs Code

hmgvhjg

Uploaded by

Sushant K

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

11/11/24, 2:27 PM ChatGPT

To help you with this dataset, let's start by examining the contents of the file you uploaded and then
design a machine learning model to process it.

I'll take a look at the structure of the file and give you a basic code framework for a machine learning
model based on its contents. Let's check the first few rows and columns of the dataset.

The dataset has the following columns:

job_id: Unique identifier for each job listing

title: Job title

location: Job location

department: Department of the job posting

salary_range: Salary range offered

company_profile: Description of the company

description: Job description

requirements: Job requirements

benefits: Benefits offered

telecommuting: Indicates if telecommuting is allowed (0 = No, 1 = Yes)

has_company_logo: Indicates if the company logo is provided (0 = No, 1 = Yes)

has_questions: Indicates if additional questions are required (0 = No, 1 = Yes)

employment_type: Type of employment (e.g., Full-time, Part-time)

required_experience: Required level of experience

required_education: Required level of education

industry: Industry sector

function: Job function

fraudulent: Target variable indicating if a job posting is fraudulent (0 = Not fraudulent, 1 =

Fraudulent)

Code Outline for Machine Learning Model to Detect Fraudulent Job Postings
We'll start by preprocessing the text fields, encoding categorical variables, and using a classification
model to detect fraudulent job postings.

Here's a code snippet to get you started with preprocessing and training a basic model in Python:
https://chatgpt.com/c/6731baf4-843c-800b-8ea6-15b5db4b5679 1/3
11/11/24, 2:27 PM ChatGPT

python Copy code

import pandas as pd from sklearn.model_selection import train_test_split from

sklearn.feature_extraction.text import TfidfVectorizer from sklearn.preprocessing import
LabelEncoder from sklearn.ensemble import RandomForestClassifier from sklearn.metrics
import classification_report, accuracy_score from sklearn.pipeline import Pipeline from
sklearn.compose import ColumnTransformer from sklearn.impute import SimpleImputer from
sklearn.preprocessing import StandardScaler # Load the dataset data =
pd.read_csv('fake_job_postings.csv') # Drop unnecessary columns and fill missing values
in text fields data['description'] = data['description'].fillna('') data['requirements']
= data['requirements'].fillna('') data['benefits'] = data['benefits'].fillna('') #
Combine relevant text fields for text processing data['text'] = data['title'] + ' ' +
data['company_profile'] + ' ' + data['description'] + ' ' + data['requirements'] + ' ' +
data['benefits'] # Encode the target variable X = data.drop(columns=['fraudulent',
'job_id', 'title', 'company_profile', 'description', 'requirements', 'benefits']) y =
data['fraudulent'] # Train-test split X_train, X_test, y_train, y_test =
train_test_split(X, y, test_size=0.2, random_state=42) # Text and numerical processing
pipelines text_transformer = Pipeline(steps=[ ('tfidf',
TfidfVectorizer(max_features=1000)) ]) # Preprocessing for numerical and binary columns
numerical_transformer = Pipeline(steps=[ ('imputer', SimpleImputer(strategy='median')),
('scaler', StandardScaler()) ]) # Column transformer preprocessor = ColumnTransformer(
transformers=[ ('text', text_transformer, 'text'), ('num', numerical_transformer,
['telecommuting', 'has_company_logo', 'has_questions']) ]) # Full pipeline with
classifier model = Pipeline(steps=[ ('preprocessor', preprocessor), ('classifier',
RandomForestClassifier(random_state=42)) ]) # Train the model model.fit(X_train,
y_train) # Predict and evaluate y_pred = model.predict(X_test) print("Accuracy:",
accuracy_score(y_test, y_pred)) print("Classification Report:\n",
classification_report(y_test, y_pred))

Explanation of the Code:

1. Data Preprocessing:

Missing values in the text fields are filled with empty strings.

Relevant text fields are combined into a single field for vectorization.

2. Pipeline Setup:

A TfidfVectorizer is used to convert text into numerical format.

A ColumnTransformer handles different types of data (text, binary, and numerical).

3. Model Training:

We use a RandomForestClassifier to detect fraudulent job postings.

After training, predictions are evaluated on accuracy and other metrics.

https://chatgpt.com/c/6731baf4-843c-800b-8ea6-15b5db4b5679 2/3
11/11/24, 2:27 PM ChatGPT

This should help you get started with running the model. Let me know if you need further
customization or help setting it up in your environment!

https://chatgpt.com/c/6731baf4-843c-800b-8ea6-15b5db4b5679 3/3

Functional Design Principles Patterns and Practices 1nbsped 0138176396 9780138176396
No ratings yet
Functional Design Principles Patterns and Practices 1nbsped 0138176396 9780138176396
376 pages
Fake Job Post Detection Using Machine Learning
100% (1)
Fake Job Post Detection Using Machine Learning
24 pages
Sample IEEE Article Ready Format
No ratings yet
Sample IEEE Article Ready Format
5 pages
Sample IEEE Article Ready Format
No ratings yet
Sample IEEE Article Ready Format
5 pages
IEEE Conference Template 9
No ratings yet
IEEE Conference Template 9
6 pages
Fake Job Entry Detectionnn
No ratings yet
Fake Job Entry Detectionnn
25 pages
Project Report: Fake Job Prediction
No ratings yet
Project Report: Fake Job Prediction
3 pages
Predicting Fraudulant Job Ads With Machine Learning
No ratings yet
Predicting Fraudulant Job Ads With Machine Learning
3 pages
Fake Job Post Prediction: Supervisor: I.Lakshmi Manikyamba Ass0Ciate Professor-Cse
No ratings yet
Fake Job Post Prediction: Supervisor: I.Lakshmi Manikyamba Ass0Ciate Professor-Cse
10 pages
Aifb Lab Manual Exp 6 - Aids
No ratings yet
Aifb Lab Manual Exp 6 - Aids
3 pages
fakejobdett
No ratings yet
fakejobdett
9 pages
Project Viva
No ratings yet
Project Viva
4 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Litrature - Survey - Keer
No ratings yet
Litrature - Survey - Keer
11 pages
ML Report Fake News Detection
No ratings yet
ML Report Fake News Detection
15 pages
Dnyaneshwar Ds
No ratings yet
Dnyaneshwar Ds
2 pages
Fake Job Detection System
No ratings yet
Fake Job Detection System
7 pages
Summer Intern
No ratings yet
Summer Intern
34 pages
Fin Ijprems1680687249
No ratings yet
Fin Ijprems1680687249
6 pages
Fake job detection using ML abstract
No ratings yet
Fake job detection using ML abstract
3 pages
ML Final
No ratings yet
ML Final
34 pages
Fake Job Prediction
No ratings yet
Fake Job Prediction
23 pages
Assignment 2: Hive
No ratings yet
Assignment 2: Hive
11 pages
FAKE JOB POST PREDICTION USING ML
No ratings yet
FAKE JOB POST PREDICTION USING ML
7 pages
AI and DS Final Document For Phase 5
No ratings yet
AI and DS Final Document For Phase 5
9 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Fake Job Detection
No ratings yet
Fake Job Detection
2 pages
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
Backtrader Essentials: Building Successful Strategies with Python
From Everand
Backtrader Essentials: Building Successful Strategies with Python
Ali AZARY
No ratings yet
"C Programming for Beginners: A Step-by-Step Guide"
From Everand
"C Programming for Beginners: A Step-by-Step Guide"
Lov kush
No ratings yet
20011f0015 Akshay PRC2 New
No ratings yet
20011f0015 Akshay PRC2 New
15 pages
sibi 5
No ratings yet
sibi 5
27 pages
Phase 5
No ratings yet
Phase 5
10 pages
Python for Data Science: Data Science Mastery by Nikhil Khan, #1
From Everand
Python for Data Science: Data Science Mastery by Nikhil Khan, #1
Nikhil Khan
No ratings yet
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
César Pérez López
No ratings yet
20011f0015 Akshay PRC3
No ratings yet
20011f0015 Akshay PRC3
18 pages
Practice Questions for UiPath Certified RPA Associate Case Based
From Everand
Practice Questions for UiPath Certified RPA Associate Case Based
Exam OG
No ratings yet
hybridmodel with cnn modifications
No ratings yet
hybridmodel with cnn modifications
5 pages
Headline Detecting Fake News With M
No ratings yet
Headline Detecting Fake News With M
3 pages
Mohit Singh Resume
No ratings yet
Mohit Singh Resume
1 page
Task Details
No ratings yet
Task Details
3 pages
Task Details
No ratings yet
Task Details
3 pages
Task Details
No ratings yet
Task Details
3 pages
Internship Task: Data Science & Machine Learning
No ratings yet
Internship Task: Data Science & Machine Learning
3 pages
Task Details
No ratings yet
Task Details
3 pages
Predicting Personal Loan Approval Using Machine Learning Handbook
No ratings yet
Predicting Personal Loan Approval Using Machine Learning Handbook
31 pages
Fin Irjmets1668589338
No ratings yet
Fin Irjmets1668589338
6 pages
unit 1 ml pdf
No ratings yet
unit 1 ml pdf
19 pages
final-way
No ratings yet
final-way
15 pages
ML Summer Training
No ratings yet
ML Summer Training
20 pages
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
Fake Online Job Recruitment[1] (5)
100% (1)
Fake Online Job Recruitment[1] (5)
13 pages
Module 3.4 Classification Models, Case Study
No ratings yet
Module 3.4 Classification Models, Case Study
12 pages
How To Use NLP in Python A Practical Step-by-Step ExampleTo Find Out The In-Demand Skills For Data SC
No ratings yet
How To Use NLP in Python A Practical Step-by-Step ExampleTo Find Out The In-Demand Skills For Data SC
12 pages
1_Swati Madhukar Taur
No ratings yet
1_Swati Madhukar Taur
2 pages
Microsoft Visual Basic Interview Questions: Microsoft VB Certification Review
From Everand
Microsoft Visual Basic Interview Questions: Microsoft VB Certification Review
Equity Press
No ratings yet
Essential n8n Playbook
From Everand
Essential n8n Playbook
Leandro Calado
No ratings yet
A Comparative Study On Fake Job Post Prediction Using Different Data Mining Techniques
100% (1)
A Comparative Study On Fake Job Post Prediction Using Different Data Mining Techniques
5 pages
Jupyter Lab
No ratings yet
Jupyter Lab
42 pages
Bringing Images to Life: Exploring DALL-E with ChatGPT
From Everand
Bringing Images to Life: Exploring DALL-E with ChatGPT
Aura-Elena Turcu
No ratings yet
OSC cie-3
No ratings yet
OSC cie-3
16 pages
Structure
No ratings yet
Structure
3 pages
Mod-3(Diagarams)
No ratings yet
Mod-3(Diagarams)
16 pages
Orbital Distance Points
No ratings yet
Orbital Distance Points
1 page
Name1 m1c1 m1c2 Merged
No ratings yet
Name1 m1c1 m1c2 Merged
233 pages
Practice
No ratings yet
Practice
45 pages
BDA Assignment 2 Questions
No ratings yet
BDA Assignment 2 Questions
1 page
Top 5 Free AI ML Deep Learning Projects For KSCST 2025
No ratings yet
Top 5 Free AI ML Deep Learning Projects For KSCST 2025
2 pages
7th Cssyll
No ratings yet
7th Cssyll
49 pages
Python p10
No ratings yet
Python p10
4 pages
Os 1
No ratings yet
Os 1
17 pages
21 MAT41 Set 1
No ratings yet
21 MAT41 Set 1
8 pages
Lesson 5 Software
No ratings yet
Lesson 5 Software
18 pages
Document 1368382.1
No ratings yet
Document 1368382.1
6 pages
Naveed Sbtech
No ratings yet
Naveed Sbtech
7 pages
Unit 1
No ratings yet
Unit 1
37 pages
Bcs402 Simp
No ratings yet
Bcs402 Simp
1 page
ZP2-F Series Fire Alarm Control Panel Operation Manual: P/N 501-405203-2-20 - REV 2.0 - ISS 03MAR11
No ratings yet
ZP2-F Series Fire Alarm Control Panel Operation Manual: P/N 501-405203-2-20 - REV 2.0 - ISS 03MAR11
32 pages
Practical Assignment #1
No ratings yet
Practical Assignment #1
4 pages
Aws 101 Presentation Deck August 2014 1
No ratings yet
Aws 101 Presentation Deck August 2014 1
47 pages
H616 Datasheet V1.0 Cleaned
No ratings yet
H616 Datasheet V1.0 Cleaned
67 pages
00-Welcome-CSCI-GA.2820-001
No ratings yet
00-Welcome-CSCI-GA.2820-001
42 pages
Compiler Design
No ratings yet
Compiler Design
20 pages
Utility Services Deployment Guide
No ratings yet
Utility Services Deployment Guide
60 pages
BSNL Jto Syllabus
No ratings yet
BSNL Jto Syllabus
3 pages
Mod07 - Technical Sales AOS-CX in The Campus
No ratings yet
Mod07 - Technical Sales AOS-CX in The Campus
43 pages
Abhay's Resume
No ratings yet
Abhay's Resume
2 pages
FRABA - Absolute - Encoder / PLC - 1 (CPU 314C-2 PN/DP) / Program Blocks
No ratings yet
FRABA - Absolute - Encoder / PLC - 1 (CPU 314C-2 PN/DP) / Program Blocks
3 pages
0-Aicte Fees (2019-2020) Final
No ratings yet
0-Aicte Fees (2019-2020) Final
39 pages
CylanceON-PREM Admin Guide
No ratings yet
CylanceON-PREM Admin Guide
117 pages
2021-2022 Time Table EVEN
No ratings yet
2021-2022 Time Table EVEN
4 pages
Packet Tracer
No ratings yet
Packet Tracer
7 pages
Worksheet 3 Quarter 3 Robotics and Electronics
No ratings yet
Worksheet 3 Quarter 3 Robotics and Electronics
7 pages
E182282-1678472976368-229532-Networking Assignment 1
No ratings yet
E182282-1678472976368-229532-Networking Assignment 1
19 pages
Demo Ms Word 2021-08-31
No ratings yet
Demo Ms Word 2021-08-31
7 pages
DX Diag 1
No ratings yet
DX Diag 1
42 pages
SM100
No ratings yet
SM100
106 pages
Got815 834
No ratings yet
Got815 834
2 pages
Chapter 9: Turing Machines
No ratings yet
Chapter 9: Turing Machines
14 pages
Datasheet-N300 Draft
No ratings yet
Datasheet-N300 Draft
9 pages
VB6.0 Notes
No ratings yet
VB6.0 Notes
120 pages