- 1、本文档共5页,可阅读全部内容。
- 2、有哪些信誉好的足球投注网站(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。
- 3、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 4、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
查看更多
Robust speech activity detection in the presence of noise
Robust Detection of Speech Activity
in the Presence of Noise
Ruhi Sarikaya, John H.L. Hansen
Robust Speech Processing Laboratory
Center for Spoken Language Research
University of Colorado Boulder, Campus Box 594
(Express Mail: 3215 Marine Street, Room E-265)
Boulder, Colorado 80309-0594
303 – 735 –5148 (Phone) 303 – 735 – 5072 (Fax)
/
John.Hansen@, (email)
R
RSPL
C
S
L
R
ICSLP-98: Inter. Conf. On Spoken Language
Processing,
Sydney, Australia, Nov. 30 – Dec. 4, 1998.
R. Sarikaya, J.H.L. Hansen, Robust detection of Speech Activity in the Presence
of Noise, ICSLP-98: Inter. Conf. on Spoken Language Processing, vol. 4, pp.
1455-1458, Sydney, Australia, Dec. 1998.
ROBUST SPEECH ACTIVITY DETECTION IN THE PRESENCE OF NOISE
Ruhi Sarikaya and John H. L. Hansen
Robust Speech Processing Laboratory
Duke University, Box 90291, Durham, NC 27708-0291
/Research/Speech ruhi@ jhlh@ee.duke
ABSTRACT
This study presents a new approach for robust speech ac-
tivity detection (SAD). Our framework is based on HMM
recognition of speech versus silence. We model speech as
one of fourteen large phone classes whereas silence is repre-
sented as a separate model. Individual test utterances are
concatenated to simulate read continuous speech for test-
ing. The HMM-based algorithm is compared to both an
energy based, as well as speech enhancement based, SAD
algorithms for clean, 5 dB and 0 dB SNR levels under
white Gaussian noise (WGN), aircraft cockpit noise (AIR)
and automobile highway noise (HWY). We found that our
algorithm provides lower frame error rates than the other
two methods especially for HWY noise. Unlike other stud-
ies, we evaluate our algorithm on the core test set of the
standard TIMIT database. Hence, results can be used as
benchmarks to evaluate future systems.
1. INTRODUCTION
Speech activity detection (SAD) is one of the fundamental
issues in many speech processing tasks such as continu-
ous speech recognition and speech enhancement. Reliable
您可能关注的文档
- PAPER Particle Swarms for Feature Extraction of Hyperspectral Data.pdf
- Parking brake.pdf
- Parsec-Scale Radio Structure of the Double Active Nucleus of NGC 6240.pdf
- part 7-the biogeochemistry of dissolved organic carbon and its interactions with climate change.pdf
- Part V Prototype Documentation for the Performance Prediction Component Delivery Slip Name.pdf
- Partial diel vertical migrations in pelagic fish.pdf
- Particle Acceleration by Electromagnetic-Dominated Outflows.pdf
- Particle acceleration in rotating and shearing jets from AGN.pdf
- Particle acceleration in solar flares observations versus numerical simulations.pdf
- Particle Acceleration, Magnetic Field Generation, and Emission in Relativistic Shocks.pdf
文档评论(0)