第22回ロボット聴覚特集 - 奥乃研究室 - 京都大学

More documents

Recommendations

Info

Fig.15 Sound spots on low frequency人の声などを出力する時は, 低周波数帯域の信号を除いても内容を理解することができるので,ハイパスフィルタで低周波信号成分を取り除くなどが考えられる.7. おわりにシミュレーションと実測から,スピーカを平面に並べることで, 音をビーム状ではなくスポット状に近い形 ( 楕円状 )に制御できることを確認した.また,今回スピーカの数を大幅に増やしたことで, 焦点位置とそのほかの位置で 15dB の音圧差を形成することができた.しかし, 低周波数帯域では音が収束せず,明確にサウンドスポットを作ることができなかった.高周波数帯域 ( 今回のシステムでは 2000Hz 以上 )では,音源間隔に比べて波長が小さくなることから, 焦点以外でも高音圧の領域が現れた. 今後は, 高周波数帯域と低周波数帯域の音場の制御を工夫することが課題である.謝辞東京理科大学の雨宮豊氏と玉井裕樹氏 , 産業技術総合研究所の高野太刀雄氏 ,R-Lab 社の長嶋功一氏と椛澤光隆氏は,本論文で述べたスピーカアレイの開発に多大な貢献をした. 東京理科大学の酒谷広太 , 大嶋聖人 , 林宏樹 , 三竹伸生各氏は音圧測定に貢献をした. 本研究の一部は科学技術振興機構さきがけ研究 21 の, 一部は文部科学省科学研究費補助金の支援により行われた. 記して謝意を表す.参考文献1) 吉川茂 , 藤田肇 ; 基礎音響学 , 講談社サイエンティフィック(2002).2) HARRY F.OLSON,P H D.Acoustic Sound 翻訳西巻正朗 , 森司 , 古川誠二郎 , 近藤巌 , 横山巧 : 無線従事者教育協会 (1959).3) 城戸健一 , 曽根敏夫 , 柴山幹夫 , 山口公典 , 中鉢憲賢 :日本音響学会講座 1 基礎音響工学 ,コロナ社 (1977).4) 二村田忠元 , 奥田襄介 , 城戸健一 , 曽根敏夫 : 電気音響工学 ,オーム社 (1963).5) 中村健太郎 : 音の仕組み,ナツメ社 (1999).6) 玉井裕樹 , 加賀美聡 , 溝口博 , 長島功一 , 高野太刀雄 :超多チャンネルスピーカアレイによるサウンドスポット形成の動特性評価 , 日本機械学会ロボティクス・メカトロニクス講演会 ‘04 講演論文集pp.1P1-H-58(1)-(4)(2004).7) 雨宮豊 , 玉井裕樹 , 加賀美聡 , 溝口博 , 長島功一 , 高野太刀雄 : 超多チャンネルマイクアレイによる生活環境下での2 次元音源定位 , 日本機械学会ロボティクス・メカトロニクス講演会 ‘04 講演論文集pp.1P1-H-58(1)-(4)(2004).8) 石綿陽一 , 松井俊浩 , 國吉康夫 :, 高度な実時間処理機能を持つ Linux の開発 , 第 16 回日本ロボット学会学術講演会予稿集 ,p335-356(1998).9) 雨宮豊 , 玉井裕樹 , 溝口博 , 加賀美聡 , 長島功一 , 高野太刀雄 : 超多チャンネルマイクアレイによる生活環境下での 2 次元音源定位 , 日本機械学会ロボティクス・メカトロニクス講演会 ‘04 講演論文集pp.1P1-H-58(1)-(4) (2004).10) 中島平太郎 :デジタルオーディオ読本 ,オーム社 ,(1991).11) 坂巻佳壽美 :デジタル信号処理 , 工業調査会 (1998).12) 臼井支郎 , 船田哲男 , 梅崎太造 , 戸田尚宏 , 萩原克行 ,横田康成 , 輿水大和 :インターユニバーシティー信号解析 ,オーム社 (1991).13) 石田義久 , 鎌田弘之 : デジタル信号処理のポイント,産業図書 (1989).14) 足立修一 :MATLAB によるデジタル信号とシステム, 東京電気大学出版局 (2002).100
社団法人人工知能学会Japanese Society forArtificial Intelligence人工知能学会JSAI Technical ReportSIG-CHallege-0522-17 (10/15)Evaluation of Missing Feature Theory Based Automatic Speech Recognitionfor Simultaneous Speech Sentences 1 Jean-Marc Valin 2 3 3 3 1 1 1Shunichi YAMAMOTO 1 ,Jean-MarcVALIN 2 , Kazuhiro NAKADAI 3 ,HiroshiTSUJINO 3 ,Kazunori KOMATANI 1 ,TetsuyaOGATA 1 ,andHiroshiG.OKUNO 11 , Graduate School of Informatics, Kyoto University2 Dept. of Electrical Engineering and Computer Engineering, Université de Sherbrooke3 () , Honda Research Institute Japan, Co., Ltd.{shunichi,komatani,ogata,okuno}@kuis.kyoto-u.ac.jp, jean-marc.valin@usherbrooke.ca,{nakadai,nakano,tsujino}@jp.honda-ri.comAbstractA robot in the real world usually hears mixturesof sounds. To achieve such a robot audition system,the integrtion of sound source separation(SSS) and automatic speech recognition (ASR)is necessary. We propose to use the missingfeature theory (MFT) as an interface with highinteroperability for the integration. The mainadvantage of this approach resides in the factthat the ASR with a clean acoustic model canadapt the distortion of separated speech by consultinga missing feature mask (MFM). In ourMFT-based robot audition system, we developeda microphone array SSS system to outputseparated speech with a MFM generated withoutany prior knowledge, and we used MultibandJulius, which supported stochastic languagemodels and recognized speech fast, forthe MFT-based ASR to recognize the separatedspeechbyusingtheMFM.Weevaluatethe robot audition system working with the humanoidSIG2. As a result, we showed the improvementin word correct rates and processingspeed through speech recognition of the mixturesof three sentences as well as those of threeisolated words.1 AURORA [1, 2] HMM [3, 4] 8ch [5] [6] 2 101
Page 4:
SCOT(Smoothed Coherence Transform)P
Page 8 and 9:
Particle (a)(b)φ12(τ )[14]x ( t )
Page 10 and 11:
- 8 -
Page 12 and 13:
1 () 2 SIMO-ICA 3 SIMO-ICA tele
Page 14:
ICAy FCy FCy SIMO-ICAs 1(t)x 1(t)1(
Page 17 and 18:
[15] Y. Mori, H. Saruwatari, T. Tak
Page 19 and 20:
社団法人人工知能学
Page 21 and 22:
• 音源位置マイク配置
Page 23 and 24:
Table 1: 6 : SIR (dB)SIR 1 SIR 2 S
Page 25 and 26:
Page 27 and 28:
SIMO-ICA SIMO Figure 2(a)SIMO-ICA
Page 29 and 30:
Binary maskConventional ICAConventi
Page 31 and 32:
Page 33 and 34:
k lo (l), k c (l), k hi (l) l k c
Page 35 and 36:
5.75 m4.33 mNoise1.15 mUser 40°2.1
Page 38 and 39:
おける方法論に関し
Page 40 and 41:
Fig.6 は幼児の ABR (Auditory
Page 42 and 43:
ンターフェースはスパイ
Page 44 and 45:
マイクロホン[ 正面 ][ 左
Page 46 and 47:
s(k)Crosstalkn(k)R S(k)X P(k)X R(k)
Page 48 and 49:
する隠れマルコフモデル
Page 50 and 51:
123ÙÖ ½ ¾º¾ ´º ½µ ´º
Page 52 and 53: ÌÐ ½ ¿º¾ ÅÎÆÇÂ
Page 54 and 55: ÁÒØÖÒØÓÒÐ ÓÒÖÒ ÓÒ Á
Page 56 and 57: 例えば、同一時間差
Page 58 and 59: いて、θの絶対値が大
Page 60 and 61: Fig.11 にこのシステムの処
Page 62 and 63: 5 , 2 EMIEWFig.1 EMIEW EMIEW 6 ,
Page 64: 0 P th , (14) 4.4 3 4 4 1 , 3
Page 67 and 68: 社団法人人工知能学
Page 69 and 70: 3.1. 3.2. Fig. 3. The
Page 71 and 72: 4.1. Fig. 5. The time co
Page 75 and 76: modal (m, ), whispery (w, ), aspir
Page 77 and 78: Aperiodicity rate (APR)TLR (Time-La
Page 81 and 82: , À, WDS-BF Ñ À℄·
Page 83 and 84: Table 1: Localization Error of A Si
Page 87 and 88: を行い, 閾値処理を
Page 89 and 90: 4. 音声対話制御実験H
Page 93 and 94: 3 HLDAMLLR [3] (Useful Information
Page 95 and 96: Class 10degClass 20degClass 10degCl
Page 99 and 100: 赤い長方形内 ). 以下
Page 101: 5.2 音場計測結果(dB SPL)
Page 105 and 106: a) 90 b) 90 MFMc) d) MFMe) 9
Page 107 and 108: (3) MFT Julius 7.1 Figure 4: SIG2
show all

第22回 ロボット聴覚特集 - 奥乃研究室 - 京都大学

Create successful ePaper yourself

Delete template?

Save as template?

第22回ロボット聴覚特集 - 奥乃研究室 - 京都大学