Analyzing temporal properties of speech trajectory using
graph structures towards speech recognition
Dublin Core
Title
Analyzing temporal properties of speech trajectory using
graph structures towards speech recognition
graph structures towards speech recognition
Subject
Graph eigenvalues
Graph signal processing
Speech analysis
Structural processing
Speech trajectory
Graph signal processing
Speech analysis
Structural processing
Speech trajectory
Description
Speech signal analysis aims to identify patterns within data to develop effective
recognition algorithms. This process primarily utilizes feature extraction
techniques such as linear predictive coding (LPC), linear predictive cepstral coefficients
(LPCCs), and Mel-frequency cepstral coefficients (MFCCs). These
features are crucial for constructing recognition algorithms that leverage both
statistical and deep learning methods. While deep learning models require extensive
datasets, they often prove unsuitable for low-resource languages. The
Hidden Markov model (HMM) is the most widely adopted statistical framework
in speech processing. However, HMMs are characterized by state-dependent
models, where each state interacts only with its neighboring states. This limitation
restricts HMMs from capturing long-term signal properties, highlighting the
need for addressing these constraints at the feature extraction stage. Most feature
extraction methods rely on short-term signal processing, which further limits
the comprehension of speech utterances. To overcome these limitations, alternative
methods are necessary to capture more comprehensive patterns. This paper
presents a graph-based approach for analyzing speech trajectories and their
temporal properties, which are subsequently validated using HMMs in speech
recognition tasks. Graph-based representations on a low-resource Telugu dataset
improve recognition accuracy by 13% while reducing processing time compared
to traditional LPC.
recognition algorithms. This process primarily utilizes feature extraction
techniques such as linear predictive coding (LPC), linear predictive cepstral coefficients
(LPCCs), and Mel-frequency cepstral coefficients (MFCCs). These
features are crucial for constructing recognition algorithms that leverage both
statistical and deep learning methods. While deep learning models require extensive
datasets, they often prove unsuitable for low-resource languages. The
Hidden Markov model (HMM) is the most widely adopted statistical framework
in speech processing. However, HMMs are characterized by state-dependent
models, where each state interacts only with its neighboring states. This limitation
restricts HMMs from capturing long-term signal properties, highlighting the
need for addressing these constraints at the feature extraction stage. Most feature
extraction methods rely on short-term signal processing, which further limits
the comprehension of speech utterances. To overcome these limitations, alternative
methods are necessary to capture more comprehensive patterns. This paper
presents a graph-based approach for analyzing speech trajectories and their
temporal properties, which are subsequently validated using HMMs in speech
recognition tasks. Graph-based representations on a low-resource Telugu dataset
improve recognition accuracy by 13% while reducing processing time compared
to traditional LPC.
Creator
Parabattina Bhagath1, Malempati Shanmukha2, Gnana Nagasri Puthi2
Source
Journal homepage: http://journal.uad.ac.id/index.php/TELKOMNIKA
Date
Sep 10, 2025
Contributor
PERI IRAWAN
Format
PDF
Language
ENGLISH
Type
TEXT
Files
Collection
Citation
Parabattina Bhagath1, Malempati Shanmukha2, Gnana Nagasri Puthi2, “Analyzing temporal properties of speech trajectory using
graph structures towards speech recognition,” Repository Horizon University Indonesia, accessed January 12, 2026, https://repository.horizon.ac.id/items/show/10375.
graph structures towards speech recognition,” Repository Horizon University Indonesia, accessed January 12, 2026, https://repository.horizon.ac.id/items/show/10375.