10.07.2015 Views

第22回 ロボット聴覚特集 - 奥乃研究室 - 京都大学

第22回 ロボット聴覚特集 - 奥乃研究室 - 京都大学

第22回 ロボット聴覚特集 - 奥乃研究室 - 京都大学

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

社 団 法 人 人 工 知 能 学 会Japanese Society forArtificial Intelligence人 工 知 能 学 会JSAI Technical ReportSIG-CHallege-0522-5 (10/14)Hands-Free Speech Recognition Using Spatial Subtraction Arraywith Adaptive Noise Estimation Prosessing under Real EnvironmentChie Kiuchi, Tomoya Takatani, Hiroshi Saruwatari, Kiyohiro ShikanoNara Institute of Science and Technologychie-k@is.naist.jpAbstractWe newly propose an improved spatial subtractionarray (SSA) with an adaptive noise estimationprocessing, which aims at the achievementof robust hands-free speech recognition in realenvironments. The previously proposed SSAcan recognise a target speech with a high accuracyunder a laboratory environment. Howeverthe conventional SSA used an ideally designednull beamformer (NBF) for noise estimation,and consequently it cannot take intoaccount the reverberation effect which arises inan actual environment. The proposed SSA introducesadaptive beamformer (ABF) for theaccurate noise estimation, and thereby remarkablyimproves the noise subtraction performanceeven under real reverberant conditions.The speech recognition experiments reveal thatthe word accuracy of the proposed SSA is superiorto that of the conventional SSA as wellas the conventional delay-and-sum beamformerand adaptive beamformer.1 (SSA) [1]SSA Delay-and-Sum (DS)[2] Griffith-Jim (GJ)[2],[3] DSDSGJ DS GJ DS GJ GJ DS GJSSA Mel FrequencyCepstrum Coefficient (MFCC)[4] (NBF)[5] NBF SSA [1] () NBF SSA () NBF SSA SSA(ABF SSA) NBF SSA ABF SSA 29

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!