ダウンロード数: 411
このアイテムのファイル:
ファイル | 記述 | サイズ | フォーマット | |
---|---|---|---|---|
soa023_044.pdf | 644.76 kB | Adobe PDF | 見る/開く |
タイトル: | A phonetic Vocoder for Very-Low-Rate Speech Coding |
著者: | Nakagawa, Seiichi Hirata, Yoshimitsu |
著者名の別形: | ナカガワ, セイイチ ヒラタ, ヨシミツ |
発行日: | 1989 |
出版者: | INSTITUTION FOR PHONETIC SCIENCES UNIVERSITY OF KYOTO |
誌名: | 音声科学研究 |
巻: | 23 |
開始ページ: | 44 |
終了ページ: | 56 |
抄録: | In this paper, we describe a phonetic vocoder based on the concatenation of syllable-units which represent speech waves by extremely low rate (100 bits/s) using a speech recognition tequnique. We take syllables into consideration as the unit of recognition/synthesis, because a syllable contains the coarticulation effect between a consonant and a vowel. Speech waves are transformed into a sequence of frames, each of which consists of LPC cepstrum, PARCOR coefficients, pitch and power. After the O(n) DP matching with 500 reference patterns, the input speech is transformed into a sequence of Japanese syllables. The information of recognized syllable contains the category of syllables, duration, power and pitch, and is represented by 16 bits. Using this vocoder, speech can be represented by only 100 bits/sec and the intelligibility of phrase for an unlimited task is about 60%. If the number of references is enlarged, say, 1600 patterns, the intelligibility becomes of more than 70%. In this case, the coding rate is about 112 bits/sec. |
URI: | http://hdl.handle.net/2433/52492 |
出現コレクション: | Vol.23 |
![](/dspace/image/articlelinker.gif)
このリポジトリに保管されているアイテムはすべて著作権により保護されています。