0% found this document useful (0 votes)

72 views17 pages

Spoken Language Processing in Python Chapter1

This document introduces audio data processing in Python. It discusses different audio file formats like mp3 and wav, and how audio is measured in frequency (kHz). It then demonstrates how to open an audio file in Python, convert the soundwave bytes to integers, find the frame rate and timestamps. Finally, it shows how to visualize two sound waves on a single plot to compare them.

Uploaded by

Fgpeqw

Available Formats

Download as PDF, TXT or read online on Scribd

Download as pdf or txt

0% found this document useful (0 votes)

72 views17 pages

Spoken Language Processing in Python Chapter1

Uploaded by

Fgpeqw

Available Formats

Download as PDF, TXT or read online on Scribd

Download as pdf or txt

You are on page 1/ 17

Introduction to

audio data in Python

S P OK EN LAN GUAGE P ROCES S IN G IN P YTH ON

Daniel Bourke
Machine Learning Engineer/YouTube
Creator
Dealing with audio les in Python
Different kinds all of audio les
mp3

wav

m4a

Digital sounds measured in frequency (kHz)

1 kHz = 1000 pieces of information per second

SPOKEN LANGUAGE PROCESSING IN PYTHON

Frequency examples
Streaming songs have a frequency of 32 kHz

Audiobooks and spoken language are between 8 and 16 kHz

We can't see audio les so we have to transform them rst

import wave

SPOKEN LANGUAGE PROCESSING IN PYTHON

Opening an audio le in Python
Audio le saved as good-morning.wav

# Import audio file as wave object

good_morning = wave.open("good-morning.wav", "r")

# Convert wave object to bytes

good_morning_soundwave = good_morning.readframes(-1)

# View the wav file in byte form

good_morning_soundwave

b'\xfd\xff\xfb\xff\xf8\xff\xf8\xff\xf7\...

SPOKEN LANGUAGE PROCESSING IN PYTHON

Working with audio is different
Have to convert the audio to something useful

Small sample of audio = large amount of information

SPOKEN LANGUAGE PROCESSING IN PYTHON

Let's practice!
S P OK EN LAN GUAGE P ROCES S IN G IN P YTH ON
Converting sound
wave bytes to
integers
S P OK EN LAN GUAGE P ROCES S IN G IN P YTH ON

Daniel Bourke
Machine Learning Engineer/YouTube
Creator
Converting bytes to integers
Can't use bytes

Convert bytes to integers using numpy

import numpy as np

# Convert soundwave_gm from bytes to integers

signal_gm = np.frombuffer(soundwave_gm, dtype='int16')

# Show the first 10 items

signal_gm[:10]

array([ -3, -5, -8, -8, -9, -13, -8, -10, -9, -11], dtype=int16)

SPOKEN LANGUAGE PROCESSING IN PYTHON

Finding the frame rate
Frequency (Hz) = length of wave object array/duration of audio le (seconds)

# Get the frame rate

framerate_gm = good_morning.getframerate()

# Show the frame rate

framerate_gm

48,000

Duration of audio le (seconds) = length of wave object array/frequency (Hz)

SPOKEN LANGUAGE PROCESSING IN PYTHON

Finding sound wave timestamps
# Return evenly spaced values between start and stop
np.linspace(start=1, stop=10, num=10)

array([ 1., 2., 3., 4., 5., 6., 7., 8., 9., 10.])

# Get the timestamps of the good morning sound wave

time_gm = np.linspace(start=0,
stop=len(soundwave_gm)/framerate_gm,
num=len(soundwave_gm))

SPOKEN LANGUAGE PROCESSING IN PYTHON

Finding sound wave timestamps
# View first 10 time stamps of good morning sound wave
time_gm[:10]

array([0.00000000e+00, 2.08334167e-05, 4.16668333e-05, 6.25002500e-05,

8.33336667e-05, 1.04167083e-04, 1.25000500e-04, 1.45833917e-04,
1.66667333e-04, 1.87500750e-04])

SPOKEN LANGUAGE PROCESSING IN PYTHON

Let's practice!
S P OK EN LAN GUAGE P ROCES S IN G IN P YTH ON
Visualizing sound
waves
S P OK EN LAN GUAGE P ROCES S IN G IN P YTH ON

Daniel Bourke
Machine Learning Engineer/YouTube
Creator
Adding another sound wave
New audio le: good_afternoon.wav

Both are 48 kHz

Same data transformations to all audio les

SPOKEN LANGUAGE PROCESSING IN PYTHON

Setting up a plot
import matplotlib.pyplot as plt

# Initialize figure and setup title

plt.title("Good Afternoon vs. Good Morning")

# x and y axis labels

plt.xlabel("Time (seconds)")
plt.ylabel("Amplitude")

# Add good morning and good afternoon values

plt.plot(time_ga, soundwave_ga, label ="Good Afternoon")
plt.plot(time_gm, soundwave_gm, label="Good Morning",
alpha=0.5)

# Create a legend and show our plot

plt.legend()
plt.show()

SPOKEN LANGUAGE PROCESSING IN PYTHON

SPOKEN LANGUAGE PROCESSING IN PYTHON
Time to visualize!
S P OK EN LAN GUAGE P ROCES S IN G IN P YTH ON

Natural language processing with TensorFlow Teach language to machines using Python s deep learning library 1st Edition Thushan Ganegedara 2024 scribd download
50% (2)
Natural language processing with TensorFlow Teach language to machines using Python s deep learning library 1st Edition Thushan Ganegedara 2024 scribd download
62 pages
Scan To BIM - Presentation
No ratings yet
Scan To BIM - Presentation
61 pages
M1 - Introducing Google Cloud v5.2 - ILT
No ratings yet
M1 - Introducing Google Cloud v5.2 - ILT
69 pages
Sony Str-dh590 Ver.1.0 SM
No ratings yet
Sony Str-dh590 Ver.1.0 SM
70 pages
Brochure - SAS - SIPC Eng Arteche
No ratings yet
Brochure - SAS - SIPC Eng Arteche
24 pages
A Guide To 21 Feature Importance Methods and Packages in Machine Learning (With Code) - by Theophano Mitsa - Dec, 2023 - Towards Data Science
100% (1)
A Guide To 21 Feature Importance Methods and Packages in Machine Learning (With Code) - by Theophano Mitsa - Dec, 2023 - Towards Data Science
41 pages
Credit Risk Modeling in Python Chapter3
No ratings yet
Credit Risk Modeling in Python Chapter3
35 pages
3CX Basic Training - Trainer Checklist
No ratings yet
3CX Basic Training - Trainer Checklist
23 pages
Designing Machine Learning Workflows in Python Chapter2
No ratings yet
Designing Machine Learning Workflows in Python Chapter2
39 pages
Bro Log Vars
No ratings yet
Bro Log Vars
6 pages
A Matlab Script To Explore Linear Predictive Coding With Vocal
No ratings yet
A Matlab Script To Explore Linear Predictive Coding With Vocal
6 pages
Analyzing IoT Data in Python Chapter3
No ratings yet
Analyzing IoT Data in Python Chapter3
30 pages
Spoken Language Processing in Python Chapter3
No ratings yet
Spoken Language Processing in Python Chapter3
26 pages
Spoken Language Processing in Python Chapter2
No ratings yet
Spoken Language Processing in Python Chapter2
23 pages
Spoken Language Processing in Python Chapter4
No ratings yet
Spoken Language Processing in Python Chapter4
46 pages
Designing Machine Learning Workflows in Python Chapter4
No ratings yet
Designing Machine Learning Workflows in Python Chapter4
38 pages
Introduction To Data Visualization With Python
No ratings yet
Introduction To Data Visualization With Python
47 pages
Transformers in NLP 1
No ratings yet
Transformers in NLP 1
9 pages
Data Analysis With Python - Day2
No ratings yet
Data Analysis With Python - Day2
159 pages
Generative Adversarial Networks (Gans) - 01: Main Notions About Gans
No ratings yet
Generative Adversarial Networks (Gans) - 01: Main Notions About Gans
2 pages
Logistic Regression
No ratings yet
Logistic Regression
24 pages
Designing Machine Learning Workflows in Python Chapter3
No ratings yet
Designing Machine Learning Workflows in Python Chapter3
42 pages
Stanford University CS224d - Deep Learning For Natural Language Processing - Syllabus
No ratings yet
Stanford University CS224d - Deep Learning For Natural Language Processing - Syllabus
3 pages
Vector Representation of Text: Vagelis Hristidis Prepared With The Help of Nhat Le Many Slides Are From Richard Socher
No ratings yet
Vector Representation of Text: Vagelis Hristidis Prepared With The Help of Nhat Le Many Slides Are From Richard Socher
20 pages
Super VIP Cheatsheet - Deep Learning
No ratings yet
Super VIP Cheatsheet - Deep Learning
47 pages
roadmap-to-crack-data-science-ml-interviews
No ratings yet
roadmap-to-crack-data-science-ml-interviews
22 pages
PDF Deep Learning with JavaScript: Neural networks in TensorFlow.js 1st Edition Shanqing Cai download
100% (2)
PDF Deep Learning with JavaScript: Neural networks in TensorFlow.js 1st Edition Shanqing Cai download
65 pages
Machine Learning Lab Assignment CSE-716: S. M. Shafkat Raihan ID: 16701041 SESSION: 2015-16
No ratings yet
Machine Learning Lab Assignment CSE-716: S. M. Shafkat Raihan ID: 16701041 SESSION: 2015-16
9 pages
Semi-Automated Exploratory Data Analysis (EDA) in Python - by Destin Gong - Mar, 2021 - Towards Data
No ratings yet
Semi-Automated Exploratory Data Analysis (EDA) in Python - by Destin Gong - Mar, 2021 - Towards Data
3 pages
Starbucks Sentiment Analysis Using VADER
No ratings yet
Starbucks Sentiment Analysis Using VADER
23 pages
Amazon SageMaker First Call Deck
No ratings yet
Amazon SageMaker First Call Deck
191 pages
Data Structure and Algorithms
No ratings yet
Data Structure and Algorithms
24 pages
Advancing Machine Learning and AI With Geography and GIS: Robert Kircher
No ratings yet
Advancing Machine Learning and AI With Geography and GIS: Robert Kircher
31 pages
Writing Code For NLP Research PDF
No ratings yet
Writing Code For NLP Research PDF
254 pages
Full Stack Data Science
No ratings yet
Full Stack Data Science
54 pages
Object Oriented Programing in Python
No ratings yet
Object Oriented Programing in Python
220 pages
Applied Coding Track
No ratings yet
Applied Coding Track
10 pages
22 Selected Top Papers On Deep Learning
No ratings yet
22 Selected Top Papers On Deep Learning
393 pages
Course Plan Natural Language Processing
No ratings yet
Course Plan Natural Language Processing
5 pages
Early Stopping in Practice
No ratings yet
Early Stopping in Practice
14 pages
LSTM Spark
No ratings yet
LSTM Spark
14 pages
PDF Introduction To Machine Learning Wit PDF
0% (1)
PDF Introduction To Machine Learning Wit PDF
3 pages
Data Preprocessing For Python
No ratings yet
Data Preprocessing For Python
3 pages
Notes On ARIMA: ND RD
No ratings yet
Notes On ARIMA: ND RD
4 pages
Instant Ebooks Textbook Learning PHP, MySQL & JavaScript, 6th Edition Robin Nixon Download All Chapters
100% (5)
Instant Ebooks Textbook Learning PHP, MySQL & JavaScript, 6th Edition Robin Nixon Download All Chapters
62 pages
Topological Methods in Data Analysis and Visualization IV Theory Algorithms and Applications 1st Edition Hamish Carr 2024 Scribd Download
100% (3)
Topological Methods in Data Analysis and Visualization IV Theory Algorithms and Applications 1st Edition Hamish Carr 2024 Scribd Download
62 pages
Data Mining - IMT Nagpur-Manish
No ratings yet
Data Mining - IMT Nagpur-Manish
82 pages
SQL
No ratings yet
SQL
98 pages
Eda PDF
100% (1)
Eda PDF
45 pages
Introduction To Data Visualization With Seaborn Chapter2
No ratings yet
Introduction To Data Visualization With Seaborn Chapter2
38 pages
Introduction to Docker
No ratings yet
Introduction to Docker
136 pages
Data Preprocessing Python 1
No ratings yet
Data Preprocessing Python 1
3 pages
What Are Some of The Best Websites To Learn Competitive Coding - Quora
No ratings yet
What Are Some of The Best Websites To Learn Competitive Coding - Quora
4 pages
PHP Developer Interview Questions
No ratings yet
PHP Developer Interview Questions
9 pages
Machine Learning Mini-Project Report
No ratings yet
Machine Learning Mini-Project Report
26 pages
Azure Developer Learning Pathway 1122i
No ratings yet
Azure Developer Learning Pathway 1122i
2 pages
Machine Learning and Data Science With Python
No ratings yet
Machine Learning and Data Science With Python
7 pages
Figure Style and Scale: Darkgrid Whitegrid Dark White Ticks Darkgrid
No ratings yet
Figure Style and Scale: Darkgrid Whitegrid Dark White Ticks Darkgrid
15 pages
Natural Language Generation
No ratings yet
Natural Language Generation
5 pages
LLM With Knowledge Graphs
No ratings yet
LLM With Knowledge Graphs
40 pages
Data Mining Classification Algorithms: Credits: Padhraic Smyth
No ratings yet
Data Mining Classification Algorithms: Credits: Padhraic Smyth
54 pages
Natural Language Toolkit NLTK PDF
No ratings yet
Natural Language Toolkit NLTK PDF
23 pages
Multi Agents Share
No ratings yet
Multi Agents Share
45 pages
DSPA - ET22BTEC046 - LAB3.ipynb - Colab
No ratings yet
DSPA - ET22BTEC046 - LAB3.ipynb - Colab
7 pages
Pydub
No ratings yet
Pydub
26 pages
Chapter3 PDF
No ratings yet
Chapter3 PDF
36 pages
Introduction To Data Visualization With Matplotlib: Ariel Rokem
No ratings yet
Introduction To Data Visualization With Matplotlib: Ariel Rokem
30 pages
Preparing Your Gures To Share With Others: Ariel Rokem
No ratings yet
Preparing Your Gures To Share With Others: Ariel Rokem
35 pages
Introduction To Data Visualization With Matplotlib Chapter2
No ratings yet
Introduction To Data Visualization With Matplotlib Chapter2
27 pages
Changing Plot Style and Color: Erin Case
No ratings yet
Changing Plot Style and Color: Erin Case
54 pages
Introduction To Data Visualization With Seaborn Chapter1
No ratings yet
Introduction To Data Visualization With Seaborn Chapter1
26 pages
Introduction To Data Visualization With Seaborn Chapter3
100% (1)
Introduction To Data Visualization With Seaborn Chapter3
32 pages
Customer Segmentation in Python Chapter3
No ratings yet
Customer Segmentation in Python Chapter3
25 pages
Customer Segmentation in Python Chapter4
No ratings yet
Customer Segmentation in Python Chapter4
37 pages
Designing Machine Learning Workflows in Python Chapter1
No ratings yet
Designing Machine Learning Workflows in Python Chapter1
32 pages
Building Chatbots in Python Chapter4
No ratings yet
Building Chatbots in Python Chapter4
20 pages
Cleaning Data With PySpark Chapter4
No ratings yet
Cleaning Data With PySpark Chapter4
23 pages
Credit Risk Modeling in Python Chapter4
100% (1)
Credit Risk Modeling in Python Chapter4
35 pages
Cleaning Data With PySpark Chapter3
No ratings yet
Cleaning Data With PySpark Chapter3
25 pages
Analyzing IoT Data in Python Chapter4
No ratings yet
Analyzing IoT Data in Python Chapter4
34 pages
Cleaning Data With PySpark Chapter2
100% (1)
Cleaning Data With PySpark Chapter2
25 pages
Cleaning Data With PySpark Chapter1
0% (1)
Cleaning Data With PySpark Chapter1
20 pages
Analyzing IoT Data in Python Chapter2
No ratings yet
Analyzing IoT Data in Python Chapter2
35 pages
Building Chatbots in Python Chapter2 PDF
No ratings yet
Building Chatbots in Python Chapter2 PDF
41 pages
Advanced NLP With Spacy Chapter4
No ratings yet
Advanced NLP With Spacy Chapter4
26 pages
Analyzing IoT Data in Python Chapter1
100% (1)
Analyzing IoT Data in Python Chapter1
27 pages
P C M AC: Telephone User's Guide
No ratings yet
P C M AC: Telephone User's Guide
2 pages
ITS Networking Reviewer
No ratings yet
ITS Networking Reviewer
6 pages
Basic Embedded System Design Tutorial PDF
100% (1)
Basic Embedded System Design Tutorial PDF
204 pages
Dock Management Controller (DMC) : General Description
No ratings yet
Dock Management Controller (DMC) : General Description
26 pages
Control Interconnections G25LTA: TB-BC
No ratings yet
Control Interconnections G25LTA: TB-BC
1 page
Virtual Networks PDF
No ratings yet
Virtual Networks PDF
45 pages
Mini Hi-Fi Component System: Operating Instructions MHC-GT555 / GT444 MHC-GT222 / GT111 Lbt-Zt4
No ratings yet
Mini Hi-Fi Component System: Operating Instructions MHC-GT555 / GT444 MHC-GT222 / GT111 Lbt-Zt4
48 pages
Chapter 6 - Networks
No ratings yet
Chapter 6 - Networks
37 pages
CE6 Accessories - Ing
No ratings yet
CE6 Accessories - Ing
24 pages
RNAV Feedback: Generated Wednesday 16th of January 2019 10:19:29 AM
No ratings yet
RNAV Feedback: Generated Wednesday 16th of January 2019 10:19:29 AM
18 pages
Aevision NVR Ae20150701110
No ratings yet
Aevision NVR Ae20150701110
9 pages
CCTV PDF
No ratings yet
CCTV PDF
29 pages
207 00 Analog and Digital Motor Control Teaching Set
No ratings yet
207 00 Analog and Digital Motor Control Teaching Set
2 pages
Sharp TV Manual - 27F543
No ratings yet
Sharp TV Manual - 27F543
59 pages
RTOM - RTDS - DOC - Procedure Data Transmission RTOM SERVA View
No ratings yet
RTOM - RTDS - DOC - Procedure Data Transmission RTOM SERVA View
14 pages
5GNR With IBflex - v1
No ratings yet
5GNR With IBflex - v1
4 pages
Siera PRO 5001 Manual
No ratings yet
Siera PRO 5001 Manual
27 pages
Wired VS Wireless Headphones The Debate On Wired Vs
No ratings yet
Wired VS Wireless Headphones The Debate On Wired Vs
2 pages
H0 ECOM100 Spec Eng
No ratings yet
H0 ECOM100 Spec Eng
2 pages
PSC Magellan Sl381
No ratings yet
PSC Magellan Sl381
2 pages
Class-7 Dated 26-08-2023 Cloud Computing
No ratings yet
Class-7 Dated 26-08-2023 Cloud Computing
15 pages
Mobile Networks: Hamid Reza Bolhasani
No ratings yet
Mobile Networks: Hamid Reza Bolhasani
58 pages
Data Link Layer: Computer Networking: A Top Down Approach Featuring The Internet
No ratings yet
Data Link Layer: Computer Networking: A Top Down Approach Featuring The Internet
170 pages
Theoretical Analysis and Design Optimization For Multi-GNSS Direct Radio Frequency Sampling Receiver
No ratings yet
Theoretical Analysis and Design Optimization For Multi-GNSS Direct Radio Frequency Sampling Receiver
13 pages
Triple-Band Combiner: 790 - 960 MHZ 1710 - 2180 MHZ 2490 - 2690 MHZ
No ratings yet
Triple-Band Combiner: 790 - 960 MHZ 1710 - 2180 MHZ 2490 - 2690 MHZ
2 pages
Notes On LonWorks-1
No ratings yet
Notes On LonWorks-1
8 pages
Chapter 4 - Communication
No ratings yet
Chapter 4 - Communication
62 pages