0% found this document useful (0 votes)

40 views3 pages

NLP Lab Programs

The document outlines several NLP lab programs using the NLTK library, including text tokenization, sentence extraction from documents, and removing stop words and punctuation. It also covers tokenization with stop words as delimiters and demonstrates stemming of words. Each program includes example code snippets and instructions for downloading necessary data.

Uploaded by

Boomika G

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views3 pages

NLP Lab Programs

Uploaded by

Boomika G

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

NLP Lab Programs

1. Tokenize a text
from [Link] import word_tokenize, sent_tokenize
import nltk

[Link]('punkt') # Download tokenizer data

# Example text
text = "NLP makes machines understand language. Tokenization is the first step."

# Sentence Tokenization
print("Sentences:", sent_tokenize(text))

# Word Tokenization
print("Words:", word_tokenize(text))

output:

2. sentences of a text document

from [Link] import sent_tokenize
import nltk

[Link]('punkt') # Download tokenizer data

# Read the text from a file

file_path = "[Link]" # Replace with your file path
with open(file_path, 'r') as file:
text = [Link]()

# Sentence Tokenization
sentences = sent_tokenize(text)

# Display the sentences

print("Sentences in the document:")
for i, sentence in enumerate(sentences, 1):
print(f"{i}: {sentence}")
save a text file as [Link] in jupyter notebook
output:

3. tokenize text with stop words as delimiters

from [Link] import word_tokenize

from [Link] import stopwords

import nltk

# Download necessary data

[Link]('punkt')

[Link]('stopwords')

# Example text

text = "I enjoy learning Python and coding."

# Define stop words

stop_words = set([Link]('english'))

# Tokenize the text

words = word_tokenize(text)

# Tokenize using stop words as delimiters

tokens_without_stopwords = [word for word in words if [Link]() not in stop_words]

# Output the result

print("Original Tokens:", words)

print("Tokens without Stop Words:", tokens_without_stopwords)

output:

4. remove stop words and punctuations in a text

from [Link] import word_tokenize

from [Link] import stopwords
import string
import nltk

# Download necessary data

[Link]('punkt')
[Link]('stopwords')

# Example text
text = "Python is great! It's simple and powerful."

# Define stop words

stop_words = set([Link]('english'))

# Tokenize the text

words = word_tokenize(text)

# Remove stop words and punctuation

tokens_cleaned = [word for word in words if [Link]() not in stop_words and word not in
[Link]]

# Output the result

print("Tokens without Stop Words and Punctuation:", tokens_cleaned)

output:

5. perform stemming
# import these modules
from [Link] import PorterStemmer
from [Link] import word_tokenize

ps = PorterStemmer()

# choose some words to be stemmed

words = ["pythonprogramming", "programs", "programmer", "event", "thankyou"]

for w in words:
print(w, " : ", [Link](w))

output:

NLP Lab1
No ratings yet
NLP Lab1
2 pages
NLP - Lab - 1.ipynb - Colab
No ratings yet
NLP - Lab - 1.ipynb - Colab
4 pages
NLP Text Preprocessing in Python
No ratings yet
NLP Text Preprocessing in Python
2 pages
NLP Lab Work
No ratings yet
NLP Lab Work
34 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
17 pages
NLP Lab File
No ratings yet
NLP Lab File
13 pages
Wsma Final Manual
No ratings yet
Wsma Final Manual
58 pages
Prog 1
No ratings yet
Prog 1
2 pages
Tokenization (Breaking Text Into Words) : Import From Import From Import From Import
No ratings yet
Tokenization (Breaking Text Into Words) : Import From Import From Import From Import
4 pages
NLP Pratical
No ratings yet
NLP Pratical
14 pages
Lab 2
No ratings yet
Lab 2
4 pages
NLP Lab - Manual
No ratings yet
NLP Lab - Manual
33 pages
NLP Tasks for MCA Students
No ratings yet
NLP Tasks for MCA Students
16 pages
NLP
No ratings yet
NLP
12 pages
NLP Lab Manual 3-2 Aiml R22 Update
100% (2)
NLP Lab Manual 3-2 Aiml R22 Update
20 pages
NLPPractical
No ratings yet
NLPPractical
12 pages
Pract3 NLP
No ratings yet
Pract3 NLP
2 pages
NLP Text Preprocessing Techniques
No ratings yet
NLP Text Preprocessing Techniques
15 pages
NLP Smitpatel
No ratings yet
NLP Smitpatel
32 pages
NLP Text Preprocessing with NLTK
No ratings yet
NLP Text Preprocessing with NLTK
1 page
Tokenization (Breaking Text Into Words) : Import From Import From Import From Import
No ratings yet
Tokenization (Breaking Text Into Words) : Import From Import From Import From Import
7 pages
AM604PC Natural Language Processing LAB R22 AI&ML 3rd Yr 2nd Sem AM604PC Natural Language Processing LAB R22 AI&ML 3rd Yr 2nd Sem
No ratings yet
AM604PC Natural Language Processing LAB R22 AI&ML 3rd Yr 2nd Sem AM604PC Natural Language Processing LAB R22 AI&ML 3rd Yr 2nd Sem
20 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
15 pages
NLP Lab File
No ratings yet
NLP Lab File
13 pages
NLP-Lab Manual - Ashwini - Kachare
No ratings yet
NLP-Lab Manual - Ashwini - Kachare
41 pages
NLP Exp1
No ratings yet
NLP Exp1
4 pages
NLP Exp2
No ratings yet
NLP Exp2
2 pages
Tokenizer
No ratings yet
Tokenizer
4 pages
NLP Experiment 2
No ratings yet
NLP Experiment 2
5 pages
NLP with NLTK in Python Guide
No ratings yet
NLP with NLTK in Python Guide
5 pages
Experiment 2
No ratings yet
Experiment 2
4 pages
NLP 02
No ratings yet
NLP 02
6 pages
Token Ization
No ratings yet
Token Ization
5 pages
NLP Tokenization and Stemming Guide
No ratings yet
NLP Tokenization and Stemming Guide
2 pages
7 Idf
No ratings yet
7 Idf
5 pages
NLP Lab File
No ratings yet
NLP Lab File
15 pages
Lab Prgms Weel1-Output
No ratings yet
Lab Prgms Weel1-Output
4 pages
Stop Words Removal in NLP Techniques
No ratings yet
Stop Words Removal in NLP Techniques
7 pages
Jal Patel NLP
No ratings yet
Jal Patel NLP
32 pages
Natural Language Processing: Practical 1
No ratings yet
Natural Language Processing: Practical 1
64 pages
NLP PRGRM-1
No ratings yet
NLP PRGRM-1
7 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
19 pages
Stop Words Removal with NLTK
No ratings yet
Stop Words Removal with NLTK
1 page
NLTK Tutorial: Basics and Techniques
No ratings yet
NLTK Tutorial: Basics and Techniques
33 pages
AI Lab Manual Aktu
No ratings yet
AI Lab Manual Aktu
11 pages
Exp1 NLP
No ratings yet
Exp1 NLP
2 pages
Week 1
No ratings yet
Week 1
14 pages
NLP Lab 1
No ratings yet
NLP Lab 1
1 page
NLP Stop Words Removal Guide
No ratings yet
NLP Stop Words Removal Guide
1 page
Tokenization (Breaking Text Into Words) : Import From Import From Import From Import
No ratings yet
Tokenization (Breaking Text Into Words) : Import From Import From Import From Import
11 pages
Experiment 2 Manual
No ratings yet
Experiment 2 Manual
6 pages
NLP Core Using NLTK: Dr. Muhammad Nouman Durrani
No ratings yet
NLP Core Using NLTK: Dr. Muhammad Nouman Durrani
42 pages
Aiml P4
No ratings yet
Aiml P4
12 pages
NLPEXP3
No ratings yet
NLPEXP3
3 pages
Natural Langauage Processing (NLP) : Tokenization of Words
No ratings yet
Natural Langauage Processing (NLP) : Tokenization of Words
8 pages
For Assignment-10 (Machine Learning With Python - NLP-2)
No ratings yet
For Assignment-10 (Machine Learning With Python - NLP-2)
37 pages
Natural Language Pre-Processing: Prepared By: Syed Afroz Ali
No ratings yet
Natural Language Pre-Processing: Prepared By: Syed Afroz Ali
81 pages
NLP Lab1
No ratings yet
NLP Lab1
6 pages
Darshnlp 2
No ratings yet
Darshnlp 2
1 page
Milk Billing System Documentation
No ratings yet
Milk Billing System Documentation
1 page
Student Bus Count for June 2024
No ratings yet
Student Bus Count for June 2024
1 page
A12 Route Visit Bus Report
No ratings yet
A12 Route Visit Bus Report
2 pages
Java Stack and Queue Implementation
No ratings yet
Java Stack and Queue Implementation
7 pages
Journal 1
No ratings yet
Journal 1
9 pages
ID3 Decision Tree
No ratings yet
ID3 Decision Tree
5 pages
Complete ID3 Decision Tree
No ratings yet
Complete ID3 Decision Tree
15 pages
Understanding Identifiers in C Programming
No ratings yet
Understanding Identifiers in C Programming
93 pages
Chapter 2
No ratings yet
Chapter 2
4 pages
LabVIEW Style Guide Checklist
No ratings yet
LabVIEW Style Guide Checklist
11 pages
DEV3600 LabGuide
No ratings yet
DEV3600 LabGuide
26 pages
MPV Manual
No ratings yet
MPV Manual
268 pages
Guide To Swapping BTSs From BSC32 or BSC6000 To BSC6900 V900R011 ©-20100930-A-V1.0
No ratings yet
Guide To Swapping BTSs From BSC32 or BSC6000 To BSC6900 V900R011 ©-20100930-A-V1.0
17 pages
Installing and Configuring M-Files Network Folder Connector
No ratings yet
Installing and Configuring M-Files Network Folder Connector
10 pages
Kindle Previewer User Guide
No ratings yet
Kindle Previewer User Guide
32 pages
Core Guide Complete
No ratings yet
Core Guide Complete
17 pages
Pymol User Manual
No ratings yet
Pymol User Manual
66 pages
Crimson 3.2 Reference Guide (LP1157B) 2.1MB-1
No ratings yet
Crimson 3.2 Reference Guide (LP1157B) 2.1MB-1
342 pages
Linux Shell Scripting Guide
50% (2)
Linux Shell Scripting Guide
23 pages
Web-N Server 2.0 Designer Guide
No ratings yet
Web-N Server 2.0 Designer Guide
47 pages
Some of Visual Foxpro's Commands
100% (1)
Some of Visual Foxpro's Commands
75 pages
Working With Raster Images: in This Chapter
No ratings yet
Working With Raster Images: in This Chapter
22 pages
How To Install OpenCV On Windows 7 - 8 (64bit) Using MinGW (64) and Codeblocks - Zahid Hasan
No ratings yet
How To Install OpenCV On Windows 7 - 8 (64bit) Using MinGW (64) and Codeblocks - Zahid Hasan
6 pages
Getting Started With Tally ERP 9
No ratings yet
Getting Started With Tally ERP 9
0 pages
Automate MP4 Conversion with Bash
No ratings yet
Automate MP4 Conversion with Bash
2 pages
MQL4 Reference / MQL4 Programs
No ratings yet
MQL4 Reference / MQL4 Programs
14 pages
SAP DeviceTypeQuickInstall en v10
No ratings yet
SAP DeviceTypeQuickInstall en v10
10 pages
YOLOv8 v9 Dataset Setup 1
No ratings yet
YOLOv8 v9 Dataset Setup 1
219 pages
PowerChute Network Shutdown
No ratings yet
PowerChute Network Shutdown
28 pages
Mod 3 Solutions
No ratings yet
Mod 3 Solutions
14 pages
JDE E1 Workshop-9.1 UI-CafeOne PDF
100% (1)
JDE E1 Workshop-9.1 UI-CafeOne PDF
32 pages
Python - Lab - Manual 2
100% (1)
Python - Lab - Manual 2
37 pages
TIBCO TRA and BW Overview and Installation Steps
50% (2)
TIBCO TRA and BW Overview and Installation Steps
8 pages
Tutorial
No ratings yet
Tutorial
24 pages
Node.js Installation and Overview Guide
No ratings yet
Node.js Installation and Overview Guide
16 pages
Easy Gui
No ratings yet
Easy Gui
43 pages
Hack Like A Pro - How To Embed A Backdoor Connection in An Innocent-Looking PDF Null Byte
No ratings yet
Hack Like A Pro - How To Embed A Backdoor Connection in An Innocent-Looking PDF Null Byte
27 pages
Basic Unix
No ratings yet
Basic Unix
14 pages
AMPPS User Guide and Troubleshooting Tips
No ratings yet
AMPPS User Guide and Troubleshooting Tips
3 pages

NLP Lab Programs

Uploaded by

NLP Lab Programs

Uploaded by

NLP Lab Programs

[Link]('punkt') # Download tokenizer data

2. sentences of a text document

[Link]('punkt') # Download tokenizer data

# Read the text from a file

# Display the sentences

3. tokenize text with stop words as delimiters

from [Link] import word_tokenize

from [Link] import stopwords

# Download necessary data

text = "I enjoy learning Python and coding."

# Define stop words

# Tokenize the text

# Tokenize using stop words as delimiters

tokens_without_stopwords = [word for word in words if [Link]() not in stop_words]

# Output the result

print("Original Tokens:", words)

print("Tokens without Stop Words:", tokens_without_stopwords)

4. remove stop words and punctuations in a text

from [Link] import word_tokenize

# Download necessary data

# Define stop words

# Tokenize the text

# Remove stop words and punctuation

# Output the result

# choose some words to be stemmed

You might also like