0% found this document useful (0 votes)

59 views

Spoken Language Processing in Python Chapter3

Uploaded by

Fgpeqw

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

59 views

Spoken Language Processing in Python Chapter3

Uploaded by

Fgpeqw

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

Introduction to

PyDub
S P OK EN LAN GUAGE P ROCES S IN G IN P YTH ON

Daniel Bourke
Machine Learning Engineer/YouTube
Creator
Installing PyDub
$ pip install pydub

If using les other than .wav , install ffmpeg via ffmpeg.org

SPOKEN LANGUAGE PROCESSING IN PYTHON

PyDub's main class, AudioSegment
# Import PyDub main class
from pydub import AudioSegment

# Import an audio file

wav_file = AudioSegment.from_file(file="wav_file.wav", format="wav")

# Format parameter only for readability

wav_file = AudioSegment.from_file(file="wav_file.wav")

type(wav_file)

pydub.audio_segment.AudioSegment

SPOKEN LANGUAGE PROCESSING IN PYTHON

Playing an audio le
# Install simpleaudio for wav playback
$pip install simpleaudio

# Import play function

from pydub.playback import play

# Import audio file

wav_file = AudioSegment.from_file(file="wav_file.wav")

# Play audio file

play(wav_file)

SPOKEN LANGUAGE PROCESSING IN PYTHON

Audio parameters
# Import audio files
wav_file = AudioSegment.from_file(file="wav_file.wav")
two_speakers = AudioSegment.from_file(file="two_speakers.wav")

# Check number of channels

wav_file.channels, two_speakers.channels

1, 2

wav_file.frame_rate

480000

SPOKEN LANGUAGE PROCESSING IN PYTHON

Audio parameters
# Find the number of bytes per sample
wav_file.sample_width

# Find the max amplitude

wav_file.max

8488

SPOKEN LANGUAGE PROCESSING IN PYTHON

Audio parameters
# Duration of audio file in milliseconds
len(wav_file)

3284

SPOKEN LANGUAGE PROCESSING IN PYTHON

Changing audio parameters
# Change ATTRIBUTENAME of AudioSegment to x
changeed_audio_segment = audio_segment.set_ATTRIBUTENAME(x)

# Change sample width to 1

wav_file_width_1 = wav_file.sample_width(1)
wav_file_width_1.sample_width

SPOKEN LANGUAGE PROCESSING IN PYTHON

Changing audio parameters
# Change sample rate
wav_file_16k = wav_file.frame_rate(16000)
wav_file_16k.frame_rate

16000

# Change number of channels

wav_file_1_channel = wav_file.set_channels(1)
wav_file_1_channel.channels

SPOKEN LANGUAGE PROCESSING IN PYTHON

Let's practice!
S P OK EN LAN GUAGE P ROCES S IN G IN P YTH ON
Manipulating audio
les with PyDub
S P OK EN LAN GUAGE P ROCES S IN G IN P YTH ON

Daniel Bourke
Machine Learning Engineer/YouTube
Creator
Turning it down to 11
# Import audio file
wav_file = AudioSegment.from_file("wav_file.wav")
# Minus 60 dB
quiet_wav_file = wav_file - 60

# Try to recognize quiet audio

recognizer.recognize_google(quiet_wav_file)

UnknownValueError:

SPOKEN LANGUAGE PROCESSING IN PYTHON

Increasing the volume
# Increase the volume by 10 dB
louder_wav_file = wav_file + 10

# Try to recognize
recognizer.recognize_google(louder_wav_file)

this is a wav file

SPOKEN LANGUAGE PROCESSING IN PYTHON

This all sounds the same
# Import AudioSegment and normalize
from pydub import AudioSegment
from pydub.effects import normalize
from pydub.playback import play

# Import uneven sound audio file

loud_quiet = AudioSegment.from_file("loud_quiet.wav")
# Normalize the sound levels
normalized_loud_quiet = normalize(loud_quiet)

# Check the sound

play(normalized_loud_quiet)

SPOKEN LANGUAGE PROCESSING IN PYTHON

Remixing your audio les
# Import audio with static at start
static_at_start = AudioSegment.from_file("static_at_start.wav")

# Remove the static via slicing

no_static_at_start = static_at_start[5000:]

# Check the new sound

play(no_static_at_start)

SPOKEN LANGUAGE PROCESSING IN PYTHON

Remixing your audio les
# Import two audio files
wav_file_1 = AudioSegment.from_file("wav_file_1.wav")
wav_file_2 = AudioSegment.from_file("wav_file_2.wav")

# Combine the two audio files

wav_file_3 = wav_file_1 + wav_file_2

# Check the sound

play(wav_file_3)

# Combine two wav files and make the combination louder

louder_wav_file_3 = wav_file_1 + wav_file_2 + 10

SPOKEN LANGUAGE PROCESSING IN PYTHON

Splitting your audio
# Import phone call audio
phone_call = AudioSegment.from_file("phone_call.wav")
# Find number of channels
phone_call.channels

# Split stereo to mono

phone_call_channels = phone_call.split_to_mono()
phone_call_channels

[<pydub.audio_segment.AudioSegment, <pydub.audio_segment.AudioSegment>]

SPOKEN LANGUAGE PROCESSING IN PYTHON

Splitting your audio
# Find number of channels of first list item
phone_call_channels[0].channels

# Recognize the first channel

recognizer.recognize_google(phone_call_channel_1)

the pydub library is really useful

SPOKEN LANGUAGE PROCESSING IN PYTHON

Let's code!
S P OK EN LAN GUAGE P ROCES S IN G IN P YTH ON
Converting and
saving audio les
with PyDub
S P OK EN LAN GUAGE P ROCES S IN G IN P YTH ON

Daniel Bourke
Machine Learning Engineer/YouTube
Creator
Exporting audio les
from pydub import AudioSegment

# Import audio file

wav_file = AudioSegment.from_file("wav_file.wav")

# Increase by 10 decibels
louder_wav_file = wav_file + 10

# Export louder audio file

louder_wav_file.export(out_f="louder_wav_file.wav", format="wav")

<_io.BufferedRandom name='louder_wav_file.wav'>

SPOKEN LANGUAGE PROCESSING IN PYTHON

Reformatting and exporting multiple audio les
def make_wav(wrong_folder_path, right_folder_path):

# Loop through wrongly formatted files

for file in os.scandir(wrong_folder_path):

# Only work with files with audio extensions we're fixing

if file.path.endswith(".mp3") or file.path.endswith(".flac"):

# Create the new .wav filename

out_file = right_folder_path + os.path.splitext(os.path.basename(file.path))[0] + ".wav"

# Read in the audio file and export it in wav format

AudioSegment.from_file(file.path).export(out_file,
format="wav")

print(f"Creating {out_file}")

SPOKEN LANGUAGE PROCESSING IN PYTHON

Reformatting and exporting multiple audio les
# Call our new function
make_wav("data/wrong_formats/", "data/right_format/")

Creating data/right_types/wav_file.wav
Creating data/right_types/flac_file.wav
Creating data/right_types/mp3_file.wav

SPOKEN LANGUAGE PROCESSING IN PYTHON

Manipulating and exporting
def make_no_static_louder(static_quiet, louder_no_static):
# Loop through files with static and quiet (already in wav format)
for file in os.scandir(static_quiet_folder_path):

# Create new file path

out_file = louder_no_static + os.path.splitext(os.path.basename(file.path))[0] + ".wav"

# Read the audio file

audio_file = AudioSegment.from_file(file.path)

# Remove first three seconds and add 10 decibels and export

audio_file = (audio_file[3100:] + 10).export(out_file, format="wav")

print(f"Creating {out_file}")

SPOKEN LANGUAGE PROCESSING IN PYTHON

Manipulating and exporting
# Remove static and make louder
make_no_static_louder("data/static_quiet/", "data/louder_no_static/")

Creating data/louder_no_static/speech-recognition-services.wav
Creating data/louder_no_static/order-issue.wav
Creating data/louder_no_static/help-with-acount.wav

SPOKEN LANGUAGE PROCESSING IN PYTHON

Your turn!
S P OK EN LAN GUAGE P ROCES S IN G IN P YTH ON

Get Started With Databricks For Machine Learning
No ratings yet
Get Started With Databricks For Machine Learning
85 pages
Credit Risk Modeling in Python Chapter3
No ratings yet
Credit Risk Modeling in Python Chapter3
35 pages
iCEDQ Brochure - Product Datasheet
No ratings yet
iCEDQ Brochure - Product Datasheet
5 pages
Designing Machine Learning Workflows in Python Chapter2
No ratings yet
Designing Machine Learning Workflows in Python Chapter2
39 pages
Analyzing IoT Data in Python Chapter3
No ratings yet
Analyzing IoT Data in Python Chapter3
30 pages
Introduction To Data Visualization With Seaborn Chapter3
100% (1)
Introduction To Data Visualization With Seaborn Chapter3
32 pages
Data Quality Administration Guide
No ratings yet
Data Quality Administration Guide
210 pages
Inkscape Manual
No ratings yet
Inkscape Manual
142 pages
كتاب إدارة الموارد البشرية - جاري ديسلر
No ratings yet
كتاب إدارة الموارد البشرية - جاري ديسلر
615 pages
Spoken Language Processing in Python Chapter2
No ratings yet
Spoken Language Processing in Python Chapter2
23 pages
Spoken Language Processing in Python Chapter4
No ratings yet
Spoken Language Processing in Python Chapter4
46 pages
Spoken Language Processing in Python Chapter1
No ratings yet
Spoken Language Processing in Python Chapter1
17 pages
Designing Machine Learning Workflows in Python Chapter1
No ratings yet
Designing Machine Learning Workflows in Python Chapter1
32 pages
Designing Machine Learning Workflows in Python Chapter4
No ratings yet
Designing Machine Learning Workflows in Python Chapter4
38 pages
Analyzing IoT Data in Python Chapter4
No ratings yet
Analyzing IoT Data in Python Chapter4
34 pages
Designing Machine Learning Workflows in Python Chapter3
No ratings yet
Designing Machine Learning Workflows in Python Chapter3
42 pages
Introduction To Data Visualization With Seaborn Chapter2
No ratings yet
Introduction To Data Visualization With Seaborn Chapter2
38 pages
Introduction To Data Visualization With Matplotlib Chapter2
No ratings yet
Introduction To Data Visualization With Matplotlib Chapter2
27 pages
Building Chatbots in Python Chapter2 PDF
No ratings yet
Building Chatbots in Python Chapter2 PDF
41 pages
Introduction To Data Visualization With Python
No ratings yet
Introduction To Data Visualization With Python
47 pages
Introduction To Data Visualization With Seaborn Chapter1
No ratings yet
Introduction To Data Visualization With Seaborn Chapter1
26 pages
List Comprehension in Python
No ratings yet
List Comprehension in Python
8 pages
Building Chatbots in Python Chapter4
No ratings yet
Building Chatbots in Python Chapter4
20 pages
Cloud Practitioner: Aws Certified
No ratings yet
Cloud Practitioner: Aws Certified
18 pages
Business Requirements Document /: Project Name Module Name
No ratings yet
Business Requirements Document /: Project Name Module Name
11 pages
Early Stopping in Practice
No ratings yet
Early Stopping in Practice
14 pages
Technologies For Handling Big Data: Prepared By: Saidatul Rahah Hamidi
No ratings yet
Technologies For Handling Big Data: Prepared By: Saidatul Rahah Hamidi
49 pages
Extraction, Transformation, and Load (ETL) Specification
No ratings yet
Extraction, Transformation, and Load (ETL) Specification
8 pages
1 - Optimize Amazon SageMaker Deployment Strategies
No ratings yet
1 - Optimize Amazon SageMaker Deployment Strategies
45 pages
Data Visualisation Using Pyplot
No ratings yet
Data Visualisation Using Pyplot
20 pages
Cleaning Data With PySpark Chapter3
No ratings yet
Cleaning Data With PySpark Chapter3
25 pages
A Practical Approach To Linear Regression in Machine Learning - by Ashwin Raj - Towards Data Science
No ratings yet
A Practical Approach To Linear Regression in Machine Learning - by Ashwin Raj - Towards Data Science
20 pages
Machine Learning With Python PDF
No ratings yet
Machine Learning With Python PDF
5 pages
Data Scientist Certification Study Guide
No ratings yet
Data Scientist Certification Study Guide
7 pages
Credit Score Validation
No ratings yet
Credit Score Validation
5 pages
T-GCPBDML-B - M2 - Data Engineering For Streaming Data - ILT Slides
No ratings yet
T-GCPBDML-B - M2 - Data Engineering For Streaming Data - ILT Slides
71 pages
SQL Server To Aurora PostgreSQL Migration Playbook 1.0 Preliminary
No ratings yet
SQL Server To Aurora PostgreSQL Migration Playbook 1.0 Preliminary
456 pages
Experiment No: 1 Introduction To Data Analytics and Python Fundamentals Page-1/11
No ratings yet
Experiment No: 1 Introduction To Data Analytics and Python Fundamentals Page-1/11
8 pages
ETL Testing Concepts iCEDQ
No ratings yet
ETL Testing Concepts iCEDQ
20 pages
Python Data Structures
No ratings yet
Python Data Structures
8 pages
Slide 13 - Kafka
No ratings yet
Slide 13 - Kafka
109 pages
Cert DEWD (Edits)
No ratings yet
Cert DEWD (Edits)
158 pages
Fast Payment Flagship - Final - Nov 1
No ratings yet
Fast Payment Flagship - Final - Nov 1
113 pages
Power BI Cheat Sheet
No ratings yet
Power BI Cheat Sheet
10 pages
I&A Tech Solution Architecture Guidelines
No ratings yet
I&A Tech Solution Architecture Guidelines
321 pages
Analyzing IoT Data in Python Chapter2
No ratings yet
Analyzing IoT Data in Python Chapter2
35 pages
ML Cheatsheets
100% (2)
ML Cheatsheets
17 pages
PySpark CheatSheet Edureka
No ratings yet
PySpark CheatSheet Edureka
1 page
Advancing Machine Learning and AI With Geography and GIS: Robert Kircher
No ratings yet
Advancing Machine Learning and AI With Geography and GIS: Robert Kircher
31 pages
Lesson 07 Data Manipulation With Pandas
No ratings yet
Lesson 07 Data Manipulation With Pandas
82 pages
2024 DQOps Ebook A Step-By-step Guide To Improve Data Quality
No ratings yet
2024 DQOps Ebook A Step-By-step Guide To Improve Data Quality
120 pages
Keras Cheat Sheet Python
No ratings yet
Keras Cheat Sheet Python
1 page
SAS Presentation
No ratings yet
SAS Presentation
49 pages
Python PPT 01
No ratings yet
Python PPT 01
286 pages
Python For Non-Programmers Final
No ratings yet
Python For Non-Programmers Final
218 pages
Logistic Regression
No ratings yet
Logistic Regression
24 pages
Pydub
No ratings yet
Pydub
26 pages
SpeechRecognition
No ratings yet
SpeechRecognition
5 pages
Voice_Assistant_Report
No ratings yet
Voice_Assistant_Report
4 pages
Week-8 Nlp Lab Program
No ratings yet
Week-8 Nlp Lab Program
6 pages
Lecture
No ratings yet
Lecture
7 pages
Voice Assistant - Doge: Bachelor of Engineering IN Computer Science & Engineering
No ratings yet
Voice Assistant - Doge: Bachelor of Engineering IN Computer Science & Engineering
48 pages
How to Create and Manage Mp3 Songs
From Everand
How to Create and Manage Mp3 Songs
Jeff Palmer
No ratings yet
Preparing Your Gures To Share With Others: Ariel Rokem
No ratings yet
Preparing Your Gures To Share With Others: Ariel Rokem
35 pages
Chapter3 PDF
No ratings yet
Chapter3 PDF
36 pages
Introduction To Data Visualization With Matplotlib: Ariel Rokem
No ratings yet
Introduction To Data Visualization With Matplotlib: Ariel Rokem
30 pages
Changing Plot Style and Color: Erin Case
No ratings yet
Changing Plot Style and Color: Erin Case
54 pages
Credit Risk Modeling in Python Chapter4
100% (1)
Credit Risk Modeling in Python Chapter4
35 pages
Customer Segmentation in Python Chapter3
No ratings yet
Customer Segmentation in Python Chapter3
25 pages
Customer Segmentation in Python Chapter4
No ratings yet
Customer Segmentation in Python Chapter4
37 pages
Cleaning Data With PySpark Chapter4
No ratings yet
Cleaning Data With PySpark Chapter4
23 pages
Cleaning Data With PySpark Chapter2
100% (1)
Cleaning Data With PySpark Chapter2
25 pages
Cleaning Data With PySpark Chapter1
0% (1)
Cleaning Data With PySpark Chapter1
20 pages
Advanced NLP With Spacy Chapter4
No ratings yet
Advanced NLP With Spacy Chapter4
26 pages
Analyzing IoT Data in Python Chapter1
100% (1)
Analyzing IoT Data in Python Chapter1
27 pages
Functionalitati Allplan Engineering
No ratings yet
Functionalitati Allplan Engineering
1 page
Punjene Paprike - Recept I Sastojci - Bosanskikuhar - Ba
No ratings yet
Punjene Paprike - Recept I Sastojci - Bosanskikuhar - Ba
9 pages
Introduction To CSS: Sensitivity: Internal & Restricted
No ratings yet
Introduction To CSS: Sensitivity: Internal & Restricted
19 pages
Ex No: 1 Web Page Design Using HTML Date:: SRC Alt Usemap Name Shape Coords Alt Href Shape Coords Alt Href
No ratings yet
Ex No: 1 Web Page Design Using HTML Date:: SRC Alt Usemap Name Shape Coords Alt Href Shape Coords Alt Href
5 pages
Bitmap: Using Using Using Using Using Using Using Using Using Namespace Public Partial Class Static Public
No ratings yet
Bitmap: Using Using Using Using Using Using Using Using Using Namespace Public Partial Class Static Public
22 pages
Log
No ratings yet
Log
26 pages
Get Started With The UIC Beamer Theme: Using L TEX To Prepare Slides
No ratings yet
Get Started With The UIC Beamer Theme: Using L TEX To Prepare Slides
32 pages
Css XML DTD
No ratings yet
Css XML DTD
23 pages
Prepladder Notes 2
No ratings yet
Prepladder Notes 2
239 pages
A 1 Note
No ratings yet
A 1 Note
5 pages
HTML LAB programs
No ratings yet
HTML LAB programs
5 pages
Multiple Videos
No ratings yet
Multiple Videos
6 pages
Coding - Intro To CSS
No ratings yet
Coding - Intro To CSS
29 pages
Interview Questions JSON
No ratings yet
Interview Questions JSON
14 pages
Advance Filters
No ratings yet
Advance Filters
42 pages
Cercul Trigonometric
No ratings yet
Cercul Trigonometric
1,380 pages
Test
No ratings yet
Test
4 pages
Irremovable List
No ratings yet
Irremovable List
68 pages
File List
No ratings yet
File List
23 pages
PrimeFaces Showcase Video
No ratings yet
PrimeFaces Showcase Video
1 page
Unit-2 HTML abhinay Kumar
No ratings yet
Unit-2 HTML abhinay Kumar
7 pages
Exif Iptc Editor Adobe Acrobat PDF
No ratings yet
Exif Iptc Editor Adobe Acrobat PDF
2 pages
HTML Difference FAQs-1
No ratings yet
HTML Difference FAQs-1
2 pages
Help Desk Final Quiz
No ratings yet
Help Desk Final Quiz
2 pages
HTML
No ratings yet
HTML
104 pages
Chapter 4
No ratings yet
Chapter 4
7 pages
HTML5 Notes
No ratings yet
HTML5 Notes
39 pages
Espania IPTV by MrRisk - SVB
50% (2)
Espania IPTV by MrRisk - SVB
3 pages

Spoken Language Processing in Python Chapter3

Uploaded by

Spoken Language Processing in Python Chapter3

Uploaded by

Introduction to

If using les other than .wav , install ffmpeg via ffmpeg.org

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Import an audio file

# Format parameter only for readability

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Import play function

# Import audio file

# Play audio file

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Check number of channels

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Find the max amplitude

SPOKEN LANGUAGE PROCESSING IN PYTHON

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Change sample width to 1

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Change number of channels

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Try to recognize quiet audio

SPOKEN LANGUAGE PROCESSING IN PYTHON

this is a wav file

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Import uneven sound audio file

# Check the sound

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Remove the static via slicing

# Check the new sound

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Combine the two audio files

# Check the sound

# Combine two wav files and make the combination louder

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Split stereo to mono

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Recognize the first channel

the pydub library is really useful

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Import audio file

# Export louder audio file

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Loop through wrongly formatted files

# Only work with files with audio extensions we're fixing

# Create the new .wav filename

# Read in the audio file and export it in wav format

SPOKEN LANGUAGE PROCESSING IN PYTHON

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Create new file path

# Read the audio file

# Remove first three seconds and add 10 decibels and export

SPOKEN LANGUAGE PROCESSING IN PYTHON

SPOKEN LANGUAGE PROCESSING IN PYTHON

You might also like