DOI: 10.2298/CSIS100906032Z

Voice Activity Detection Method Based on Multi-valued Coarse-graining Lempel-Ziv Complexity

Huan Zhao1, Gangjin Wang1, Cheng Xu1 and Fei Yu2

  1. School of Information Science and Engineering, Hunan University,
    410082 Changsha, P. R. China
    hzhao@hnu.edu.cn, wgangjin@163.com, chengxu@public.cs.hn.cn
  2. Jiangsu Provincial Key Laboratory of Computer Information Processing Technology,
    215000 Suzhou, P. R. China
    hunanyufei@126.com

Abstract

One of the key issues in practical speech processing is to locate precisely endpoints of the input utterance to be free of non-speech regions. Although lots of studies have been performed to solve this problem, the operation of existing voice activity detection (VAD) algorithms is still far away from ideal. This paper proposes a novel robust feature for VAD method that is based on multi-valued coarse-graining Lempel-Ziv Complexity (MLZC), which is an improved algorithm of the binary coarse-graining Lempel-Ziv Complexity (BLZC). In addition, we use fuzzy c-Means clustering algorithm and the Bayesian information criterion algorithm to estimate the thresholds of the MLZC characteristic, and adopt the dual-thresholds method for VAD. Experimental results on the TIMIT continuous speech database show that at low SNR environments, the detection performance of the proposed MLZC method is superior to the VAD in GSM ARM, G.729 and BLZC method.

Key words

speech processing, voice activity detection, Lempel-Ziv complexity, multi-valued coarse-graining, fuzzy c-Means clustering algorithm, Bayesian information criterion algorithm

Digital Object Identifier (DOI)

https://doi.org/10.2298/CSIS100906032Z

Publication information

Volume 8, Issue 3 (June 2011)
Year of Publication: 2011
ISSN: 1820-0214 (Print) 2406-1018 (Online)
Publisher: ComSIS Consortium

Full text

DownloadAvailable in PDF
Portable Document Format

How to cite

Zhao, H., Wang, G., Xu, C., Yu, F.: Voice Activity Detection Method Based on Multi-valued Coarse-graining Lempel-Ziv Complexity. Computer Science and Information Systems, Vol. 8, No. 3, 869-888. (2011), https://doi.org/10.2298/CSIS100906032Z