- 1、本文档共8页,可阅读全部内容。
- 2、有哪些信誉好的足球投注网站(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。
- 3、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 4、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
查看更多
A Comparison of Different Approaches to Automatic Speech Segmentation
A Comparison of Different Approaches
to Automatic Speech Segmentation
Kris Demuynck and Tom Laureys?
K.U.Leuven ESAT/PSI
Kasteelpark Arenberg 10
B-3001 Leuven, Belgium
{kris.demuynck,tom.laureys}@esat.kuleuven.ac.be
http://www.esat.kuleuven.ac.be/~spch
Abstract. We compare different methods for obtaining accurate speech
segmentations starting from the corresponding orthography. The com-
plete segmentation process can be decomposed into two basic steps. First,
a phonetic transcription is automatically produced with the help of large
vocabulary continuous speech recognition (LVCSR). Then, the phonetic
information and the speech signal serve as input to a speech segmentation
tool. We compare two automatic approaches to segmentation, based on
the Viterbi and the Forward-Backward algorithm respectively. Further,
we develop different techniques to cope with biases between automatic
and manual segmentations. Experiments were performed to evaluate the
generation of phonetic transcriptions as well as the different speech seg-
mentation methods.
1 Introduction
In this paper we investigate the development of an accurate speech segmenta-
tion system for the Spoken Dutch Corpus project. Speech segmentations, on
phoneme (e.g. TIMIT) or word level (e.g. Switchboard, CGN), have become a
standard annotation in speech corpora. Corpus users can benefit from the fact
that the segmentation couples the speech signal to the other annotation layers
(orthography, phonetics) by means of time stamps, thus providing easy access
to audio fragments in the corpus. For the speech technologist segmentations are
indispensable for the initial training of acoustic ASR models, the development
of TTS systems and speech research in general.
Some speech corpora only provide automatic segmentations, obviously requir-
ing an accurate segmentation algorithm. In other corpora speech segmentations
are checked manually. The latter case requires a high-quality automatic segmen-
tation system as well, since a bet
您可能关注的文档
- 2015年北美电影票房Top200.pdf
- 2015年北京大学考研英语多词性单词及其相应词义解析(二).pdf
- 2015年外交学院翻译硕士汉语写作与百科知识考研真题,考研经验,重难点.pdf
- 2015年考研英语一.pdf
- 2015年职称英语理工类A级新增文章.doc
- 2015年英国TOP50名校新增专业指南.pdf
- 2015年辽宁省沈阳市中考英语试题分析与对策.pdf
- 2015新加坡必威体育精装版TEP填写表格.pdf
- 2015春《欧洲文化入门》判断题在线作业-满分.doc
- 2015美国大学Public Health专业排名.doc
- 2024-2025学年统编版(2024)道德与法治小学一年级上册每课教学反思(含教材目录) .docx
- 2023年7月国家开放大学专本科《法律文书》期末纸质考试试题及答案 (2).pdf
- Unit 2【单元测试·提升卷】-2023-2024学年八年级英语上册单元速记巧练(牛津译林版) .pdf
- 某某房产管理服务中心(所)各岗位廉政风险点及防控措施一览表.pdf
- 棒棒冰相关行业投资规划报告范本 .docx
- 五年级语文上册 第六七单元达标检测卷(原卷+答案)2024-2025学年第一学期(统编版) .pdf
- 计算机及外部设备维修服务相关行业投资方案 .pdf
- 湖北省武汉市新河街学校小学部小升初数学试卷(含解析) .pdf
- Unit 1【单元测试·提升卷】-2023-2024学年八年级英语上册单元速记巧练(牛津译林版) .pdf
- 自身免疫性肺泡蛋白沉积症与抗GMCSF抗体 .pdf
文档评论(0)