0% found this document useful (0 votes)

5 views

Setting Up Packages for Speech Recognition

This document provides a step-by-step guide for setting up packages necessary for speech recognition, including activating the environment, installing audio processing libraries, and preparing for Whisper installation. It outlines specific commands for installing packages like librosa, SpeechRecognition, and Whisper AI, as well as additional setup for FFmpeg and PyTorch. Finally, it includes a verification step to ensure Whisper is correctly installed.

Uploaded by

rndattaba

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

Setting Up Packages for Speech Recognition

Uploaded by

rndattaba

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Setting Up Packages for

Speech Recognition

Step 1: Ensure Your Environment is Active

• Open Anaconda Prompt (or Terminal if you’re using macOS).

• Confirm that you are in the correct environment. You should see the environment name
(e.g., (speech_env)) in parentheses.

• If you’re not in the correct environment, activate it with:

conda activate speech_env

Step 2: Install Audio Processing and Visualization

Packages
1. Install librosa (for audio processing):

o Run the following command in Anaconda Prompt (or Terminal if you’re using
macOS):

pip install librosa

2. Install SpeechRecognition (for converting spoken language to text):

o Enter the following command:

pip install SpeechRecognition

o Note: Be careful with the capitalization and spacing—SpeechRecognition must

be typed as a single word.

3. Install jiwer (for evaluating speech recognition accuracy):

o Type the following command:

pip install jiwer

4. Install matplotlib (for visualizations):

o Use this command to install:

pip install matplotlib

5. Install Google Text-to-Speech (gTTS) (for converting text into speech):

o Run this command:

pip install gTTS

Step 3: Prepare for Whisper Installation

Whisper by OpenAI requires some additional setup to handle multimedia data effectively.
Follow these steps to ensure everything is in place:

1. Install PyTorch:

o Go to the PyTorch website and select the “Get Started” option.

o Under “Start locally”, configure the following:

▪ Stable version

▪ Your operating system (e.g., Windows, macOS)

▪ Package: pip

▪ Language: Python

▪ Compute platform: Select a CUDA version if you have a GPU or select

CPU.

o Copy the provided installation command and paste it into Anaconda Prompt,
then press Enter.

2. Install Chocolatey (Windows only):

o Go to the Chocolatey website and click Install.

o Choose Individual installation and copy the provided code.

o Open Windows PowerShell as an administrator, paste the code, and run it.

Note: macOS users can use Homebrew instead of Chocolatey for the following steps.

3. Install FFmpeg:

o In Anaconda Prompt (or PowerShell with Chocolatey), type:

choco install ffmpeg

o Confirm with “Y” when prompted. FFmpeg is essential for Whisper to handle
various audio formats.

Step 4: Install Whisper AI for Speech Recognition

• With the environment activated in Anaconda Prompt, install Whisper using the
following command:

pip install -U openai-whisper

• Troubleshooting: If you receive an error stating that Whisper isn’t found after
installation, you can also try running the above pip install command in Command
Prompt instead of Anaconda Prompt.

Step 5: Verify Whisper Installation

• To confirm Whisper is correctly installed, type:

pip show openai-whisper

o If installed, this command will display Whisper’s details.

You’re all set! With all packages installed, you’re ready to dive into speech-to-text exercises.

Fine-Tune Whisper For Multilingual ASR With Transformers
No ratings yet
Fine-Tune Whisper For Multilingual ASR With Transformers
24 pages
Learn Python in 10 Minutes
From Everand
Learn Python in 10 Minutes
Victor Ebai
4/5 (30)
Desktop Assistant Final
No ratings yet
Desktop Assistant Final
15 pages
How To Use Whisper AI - The Only Guide You Need
No ratings yet
How To Use Whisper AI - The Only Guide You Need
20 pages
Voice Assistant - Doge: Bachelor of Engineering IN Computer Science & Engineering
No ratings yet
Voice Assistant - Doge: Bachelor of Engineering IN Computer Science & Engineering
48 pages
suryanarayan 3
No ratings yet
suryanarayan 3
2 pages
Coding The Future: A Comprehensive Guide To AI Development-By Tyler P Welch - The Astral Merchant
No ratings yet
Coding The Future: A Comprehensive Guide To AI Development-By Tyler P Welch - The Astral Merchant
31 pages
Installing Whisper WebUI On Windows 10 - Web
No ratings yet
Installing Whisper WebUI On Windows 10 - Web
24 pages
Coding The Future: A Comprehensive Guide To AI Development-By Tyler Welch
No ratings yet
Coding The Future: A Comprehensive Guide To AI Development-By Tyler Welch
180 pages
Speech Understanding Content
No ratings yet
Speech Understanding Content
9 pages
Code in Voices
No ratings yet
Code in Voices
10 pages
How Speech Recognition Works: Hidden Markov Model
No ratings yet
How Speech Recognition Works: Hidden Markov Model
25 pages
Open AI Python
No ratings yet
Open AI Python
1 page
py report
No ratings yet
py report
8 pages
Project Testing
No ratings yet
Project Testing
11 pages
SpeechRecognition
No ratings yet
SpeechRecognition
5 pages
Import Datetime
No ratings yet
Import Datetime
6 pages
Virtual Assistance Project Brief
No ratings yet
Virtual Assistance Project Brief
8 pages
PYTHON: Practical Python Programming For Beginners & Experts With Hands-on Project
From Everand
PYTHON: Practical Python Programming For Beginners & Experts With Hands-on Project
Mark Chan
5/5 (4)
Python for Beginners: An Introduction to Learn Python Programming with Tutorials and Hands-On Examples
From Everand
Python for Beginners: An Introduction to Learn Python Programming with Tutorials and Hands-On Examples
Nathan Metzler
4/5 (2)
PythonCodes
No ratings yet
PythonCodes
2 pages
TSA Lab 2
No ratings yet
TSA Lab 2
3 pages
Chat Bot 1
No ratings yet
Chat Bot 1
7 pages
Lab Task # 5 Speech To Text AI
No ratings yet
Lab Task # 5 Speech To Text AI
2 pages
Project Report
No ratings yet
Project Report
58 pages
GroqCloud Speech Doc
No ratings yet
GroqCloud Speech Doc
5 pages
Voice Assistant
No ratings yet
Voice Assistant
3 pages
Group No. 5: AI Desktop Assistant
No ratings yet
Group No. 5: AI Desktop Assistant
10 pages
Speech Recognition System
No ratings yet
Speech Recognition System
16 pages
Presentation On - Ohh Toodle, An Assistant: Presented by Presented To
No ratings yet
Presentation On - Ohh Toodle, An Assistant: Presented by Presented To
10 pages
Speech Understanding Content
No ratings yet
Speech Understanding Content
10 pages
jarvis
No ratings yet
jarvis
4 pages
Labs_9
No ratings yet
Labs_9
4 pages
Voice Assistant Using Python 2
No ratings yet
Voice Assistant Using Python 2
20 pages
Speech Recognition
No ratings yet
Speech Recognition
13 pages
Data Sorting Guideline
No ratings yet
Data Sorting Guideline
2 pages
Voice Assistant Suggetion
No ratings yet
Voice Assistant Suggetion
3 pages
Building A ChatGPT-4 Voice Assistant With Vivid U
No ratings yet
Building A ChatGPT-4 Voice Assistant With Vivid U
18 pages
Speech Recognition Transcription With Open Source ...
No ratings yet
Speech Recognition Transcription With Open Source ...
2 pages
V Assist
No ratings yet
V Assist
3 pages
2. Sphinx speech recognition
No ratings yet
2. Sphinx speech recognition
5 pages
Speech To Text Conversion
No ratings yet
Speech To Text Conversion
7 pages
This Should Be Finnal
No ratings yet
This Should Be Finnal
40 pages
Speech Recog
No ratings yet
Speech Recog
5 pages
Python Pre-Installations
No ratings yet
Python Pre-Installations
2 pages
voice_assistant_code
No ratings yet
voice_assistant_code
4 pages
AI Desktop
No ratings yet
AI Desktop
14 pages
PPT_Format_edit[1] (2)
No ratings yet
PPT_Format_edit[1] (2)
10 pages
REAL TIME TRANSCRIPTION SERVICE FOR ONLINE MEETINGS USING WHISPER API
No ratings yet
REAL TIME TRANSCRIPTION SERVICE FOR ONLINE MEETINGS USING WHISPER API
16 pages
Fai Lab Project By-:Group 6
No ratings yet
Fai Lab Project By-:Group 6
7 pages
Lecture
No ratings yet
Lecture
7 pages
Ai Voice Assistant
No ratings yet
Ai Voice Assistant
14 pages
A Simple Guide To OpenAI API With Python
No ratings yet
A Simple Guide To OpenAI API With Python
9 pages
I Guess This Will Be Finnal
No ratings yet
I Guess This Will Be Finnal
41 pages
Paperpublish
No ratings yet
Paperpublish
2 pages
Building of Personalised Ai Assistant Phase 2
No ratings yet
Building of Personalised Ai Assistant Phase 2
10 pages
Voice M
No ratings yet
Voice M
19 pages
Jarvis Voice Assistant
No ratings yet
Jarvis Voice Assistant
2 pages
synopsis
No ratings yet
synopsis
6 pages
dhara_NLP_Practical
No ratings yet
dhara_NLP_Practical
67 pages
KNIME Python Integration Guide: KNIME AG, Zurich, Switzerland Version 4.3 (Last Updated On 2020-12-06)
No ratings yet
KNIME Python Integration Guide: KNIME AG, Zurich, Switzerland Version 4.3 (Last Updated On 2020-12-06)
20 pages
(Ebook) Advanced Applied Deep Learning: Convolutional Neural Networks and Object Detection by Umberto Michelucci ISBN 9781484249758, 1484249755 - Get the ebook in PDF format for a complete experience
100% (3)
(Ebook) Advanced Applied Deep Learning: Convolutional Neural Networks and Object Detection by Umberto Michelucci ISBN 9781484249758, 1484249755 - Get the ebook in PDF format for a complete experience
84 pages
Industrial Training Report: Course "Artificial Intelligence"
No ratings yet
Industrial Training Report: Course "Artificial Intelligence"
30 pages
Instant ebooks textbook Machine Learning Pocket Reference Working with Structured Data in Python 1st Edition Matt Harrison download all chapters
100% (4)
Instant ebooks textbook Machine Learning Pocket Reference Working with Structured Data in Python 1st Edition Matt Harrison download all chapters
65 pages
Problem Solving With Python Compress
No ratings yet
Problem Solving With Python Compress
326 pages
A Python Library For Teaching Computation To Seismology Students
No ratings yet
A Python Library For Teaching Computation To Seismology Students
7 pages
2. Python Programming Development Environment Set-up
No ratings yet
2. Python Programming Development Environment Set-up
19 pages
Practical Data Science with Jupyter: Explore Data Cleaning, Pre-processing, Data Wrangling, Feature Engineering and Machine Learning using Python and Jupyter (English Edition) Prateek Gupta - Download the complete ebook in PDF format and read freely
100% (4)
Practical Data Science with Jupyter: Explore Data Cleaning, Pre-processing, Data Wrangling, Feature Engineering and Machine Learning using Python and Jupyter (English Edition) Prateek Gupta - Download the complete ebook in PDF format and read freely
72 pages
Blockchain Analytics
No ratings yet
Blockchain Analytics
9 pages
?python For Data Analysis Cheatsheet
100% (3)
?python For Data Analysis Cheatsheet
128 pages
Geemap Readthedocs Io en Latest
No ratings yet
Geemap Readthedocs Io en Latest
94 pages
007 Python Introduction
No ratings yet
007 Python Introduction
26 pages
Introduction To Python (Lab)
No ratings yet
Introduction To Python (Lab)
28 pages
Raspberry Pi and Python
No ratings yet
Raspberry Pi and Python
108 pages
Tting Started With Python
No ratings yet
Tting Started With Python
12 pages
Id-11659 Scrapping Web
No ratings yet
Id-11659 Scrapping Web
295 pages
Quaternion Open Risk Platform Userguide
No ratings yet
Quaternion Open Risk Platform Userguide
194 pages
2024 CS224N Python Review Session Slides.pptx
No ratings yet
2024 CS224N Python Review Session Slides.pptx
66 pages
BIG DATA ANALYTICS Lab Manual
No ratings yet
BIG DATA ANALYTICS Lab Manual
51 pages
PDF Learning IPython for Interactive Computing and Data Visualization - Second Edition Cyrille Rossant download
No ratings yet
PDF Learning IPython for Interactive Computing and Data Visualization - Second Edition Cyrille Rossant download
55 pages
06 - Food Calorie Estimation
50% (2)
06 - Food Calorie Estimation
39 pages
(Ebook) Python Business Intelligence Cookbook by Robert Dempsey ISBN 9781785287466, 178528746X download pdf
100% (2)
(Ebook) Python Business Intelligence Cookbook by Robert Dempsey ISBN 9781785287466, 178528746X download pdf
67 pages
Ultimate Data Science Programming in Python 9365895669
No ratings yet
Ultimate Data Science Programming in Python 9365895669
756 pages
Deep Learning With PyTorch: Object Classification - Filliat Et Al
No ratings yet
Deep Learning With PyTorch: Object Classification - Filliat Et Al
3 pages
STAT 451: Intro To Machine Learning Lecture Notes
100% (1)
STAT 451: Intro To Machine Learning Lecture Notes
17 pages
Data Mining and Predictive Modelling Assignment
No ratings yet
Data Mining and Predictive Modelling Assignment
34 pages
Python Pandas1
No ratings yet
Python Pandas1
39 pages
Python Lab Manual - EC - Dept
No ratings yet
Python Lab Manual - EC - Dept
23 pages
Installing_Anaconda_Tensorflow
No ratings yet
Installing_Anaconda_Tensorflow
4 pages
1.1-1.4_Introduction to Python
No ratings yet
1.1-1.4_Introduction to Python
50 pages

Setting Up Packages for Speech Recognition

Uploaded by

Setting Up Packages for Speech Recognition

Uploaded by

Setting Up Packages for

Step 1: Ensure Your Environment is Active

• If you’re not in the correct environment, activate it with:

conda activate speech_env

Step 2: Install Audio Processing and Visualization

pip install librosa

2. Install SpeechRecognition (for converting spoken language to text):

o Enter the following command:

pip install SpeechRecognition

o Note: Be careful with the capitalization and spacing—SpeechRecognition must

3. Install jiwer (for evaluating speech recognition accuracy):

o Type the following command:

pip install jiwer

4. Install matplotlib (for visualizations):

o Use this command to install:

pip install matplotlib

5. Install Google Text-to-Speech (gTTS) (for converting text into speech):

o Run this command:

Step 3: Prepare for Whisper Installation

o Go to the PyTorch website and select the “Get Started” option.

o Under “Start locally”, configure the following:

▪ Your operating system (e.g., Windows, macOS)

▪ Compute platform: Select a CUDA version if you have a GPU or select

2. Install Chocolatey (Windows only):

o Go to the Chocolatey website and click Install.

o Choose Individual installation and copy the provided code.

o In Anaconda Prompt (or PowerShell with Chocolatey), type:

choco install ffmpeg

Step 4: Install Whisper AI for Speech Recognition

pip install -U openai-whisper

Step 5: Verify Whisper Installation

• To confirm Whisper is correctly installed, type:

pip show openai-whisper

o If installed, this command will display Whisper’s details.

You might also like