File
Authors
Hanada, Junichi
Keywords
speech separation
auditory scene analysis
sequential processing model
NDC
Technology. Engineering
Abstract
Speech separation based on auditory scene analysis (ASA) has been widely studied. We propose a sequential processing model of computational ASA (CASA), in which a mixed speech is sequentially decomposed into frequency signals using modified Discrete Fourier Transform (DFT), four features in ASA are extracted from the decomposed frequency signals, the frequency signals are regrouped by examining the extracted features, and each separated speech is obtained by recomposing the frequency signals in a group. In this paper, we attempt to separate speeches only using the harmonic structure, which is one of the features and regarded as the backbone in our sequential implementation model.
Publisher
IEEE
Content Type
Conference Paper
Link
https://ieeexplore.ieee.org/abstract/document/7432750/
ISSN・ISBN
978-1-4244-5015-2
Journal Title
Proceedings of International Symposium on Intelligent Signal Processing and Communication Systems 2015 (ISPACS2015)
Current Journal Title
Proceedings of International Symposium on Intelligent Signal Processing and Communication Systems 2015 (ISPACS2015)
Start Page
124
End Page
128
Published Date
2015
DOI
Text Version
Author
Rights
© © 2015 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Citation
Proceedings of International Symposium on Intelligent Signal Processing and Communication Systems 2015 (ISPACS2015). 2015, 124-128
Department
Faculty of Engineering/Graduate School of Engineering
Language
English