ACM Home Page
Please provide us with feedback. Feedback
Study of best algorithm combinations for speech processing tasks in machine learning using median vs. mean clusters in MARF
Full text PdfPdf (296 KB)
Source C3S2E; Vol. 290 archive
Proceedings of the 2008 C3S2E conference table of contents
Montreal, Quebec, Canada
SESSION: Algorithms table of contents
Pages 29-43  
Year of Publication: 2008
ISBN:978-1-60558-101-9
Author
Serguei A. Mokhov  Concordia University, Montreal, Quebec, Canada
Sponsors
: ACM International Conference Proceedings Series
Concordia University : Concordia University
: BytePress
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 47,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1370256.1370262
What is a DOI?

ABSTRACT

This work reports experimental results and their analysis in various speech processing tasks using SpeakerIdentApp, a text-independent speaker identification application, based on Modular Audio Recognition Framework (MARF)'s API and its implementation in terms of best of the available algorithm configurations for each particular task using median clusters as opposed to the default mean clusters. This study focuses on the tasks of identification of speakers' as of who they are, their gender, and accent through machine learning. This work significantly complements two preceding statistical studies undertaken using only mean clusters and shows the difference in selection of the best algorithm combinations using the median cluster approach. To the author's knowledge there was no any previous comprehensive study in this regard.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
H. Abdi. Distance. In N. J. Salkind (Ed.): Encyclopedia of Measurement and Statistics. Thousand Oaks (CA): Sage, 2007.
 
2
T. S. G. at Carnegie Mellon. The CMU Sphinx Group Open Source Speech Recognition Engines. cmusphinx.org, 2007. http://cmusphinx.sourceforge.net.
 
3
S. M. Bernsee. The DFT "à pied": Mastering The Fourier Transform in One Day. DSPdimension.com, 1999--2005. http://www.dspdimension.com/data/html/dftapied.html.
 
4
I. Clement, S. A. Mokhov, D. Nicolacopoulos, S. Fan, and the MARF Research & Development Group. TestLPC -- Testing LPC Algorithm Implementation within MARF. Published electronically within the MARF project, http://marf.sf.net, 2002--2008. Last viewed April 2008.
 
5
I. Clement, S. A. Mokhov, and the MARF Research & Development Group. TestNN -- Testing Artificial Neural Network in MARF. Published electronically within the MARF project, http://marf.sf.net, 2002--2008. Last viewed April 2008.
 
6
S. Fan, S. A. Mokhov, and the MARF Research & Development Group. TestLoaders -- Testing Sample Loaders of MARF. Published electronically within the MARF project, http://marf.sf.net, 2005--2008. Last viewed April 2008.
 
7
E. Gamma and K. Beck. JUnit. Object Mentor, Inc., 2001--2004. http://junit.org/.
 
8
E. Garcia. Cosine similarity and term weight tutorial, 2006. http://www.miislita.com/information-retrieval-tutorial/cosine-similarity-tutorial.html.
 
9
T. M. R. D. Group. The Modular Audio Recognition Framework and its Applications. SourceForge.net, 2002--2008. http://marf.sf.net.
 
10
R. W. Hamming. Error Detecting and Error Correcting Codes. Bell System Technical Journal 26(2):147--160, 1950. http://en.wikipedia.org/wiki/Hamming_distance.
 
11
S. Haridas. Generation of 2-d digital filters with variable magnitude characteristics starting from a particular type of 2-variable continued fraction expansion. Master's thesis, Concordia University, Montréal, Canada, July 2006.
 
12
M. Khalifé. Examining orthogonal concepts-based micro-classifiers and their correlations with noun-phrase coreference chains. Master's thesis, Concordia University, Montréal, Canada, 2004.
 
13
A. Kishore. Similarity measure: Cosine similarity or euclidean distance or both, Feb. 2007. http://semanticvoid.com/blog/2007/02/23/similarity-measure-cosine-similarity-or-euclidean-distance-or-both/.
 
14
D. Mackenzie, P. Eggert, and R. Stallman. Comparing and Merging Files. 2002. http://www.gnu.org/software/diffutils/manual/ps/diff.ps.gz.
 
15
P. Mahalanobis. On the generalised distance in statistics. Proceedings of the National Institute of Science of India 12 (1936) 49--55, 1936. http://en.wikipedia.org/wiki/Mahalanobis_distance.
 
16
S. Mokhov. On Design and Implementation of Distributed Modular Audio Recognition Framework: Requirements and Specification Design Document. Department of Computer Science and Software Engineering, Concordia University, Montreal, Canada, Aug. 2006. http://marf.sf.net.
 
17
S. Mokhov, I. Clement, S. Sinclair, and D. Nicolacopoulos. Modular Audio Recognition Framework. Department of Computer Science and Software Engineering, Concordia University, Montreal, Canada, 2002--2003. http://marf.sf.net.
 
18
S. A. Mokhov. Introducing MARF: a Modular Audio Recognition Framework and its Applications for Scientific and Software Engineering Research. In Proceedings of the IEEE Engineering/Computing and Systems Research E-Conference (SCSS07/CIISE 2007), University of Bridgeport, U.S.A., Dec. 2007. Springer. To appear, http://cisse2007.org.
 
19
S. A. Mokhov. MARF for PureData for MARF. In Pd Convention '07, Montreal, Quebec, Canada. artengine.ca, Aug. 2007. http://artengine.ca/~catalogue-pd/32-Mokhov.pdf.
 
20
S. A. Mokhov. Choosing Best Algorithm Combinations for Speech Processing Tasks in Machine Learning Using MARF. In S. Bergler, editor, Proceedings of 21st Canadian AI'08, pages 216--221, Berlin Heidelberg, May 2008. Springer-Verlag.
 
21
S. A. Mokhov. Experimental Results and Statistics in the Implementation of the Modular Audio Recognition Framework's API for Text-Independent Speaker Identification. In Submitted for publication at CCCT'08, Jan. 2008. http://www.infocybereng.org/ccct2008i.
 
22
S. A. Mokhov. File Type Analysis Using Signal Processing Techniques and Machine Learning vs. file Unix Utility for Forensic Analysis. Unpublished, 2008.
 
23
S. A. Mokhov. Towards Hardening Security in Scientific Demand-Driven and Pipelined Distributed Computing Systems. In Submitted for publication at ISPDC'08, 2008.
 
24
S. A. Mokhov. Writer Identification Using Inexpensive Signal Processing Techniques: Experimental Results. 2008. Submitted for publication at ICFHR'08.
 
25
S. A. Mokhov, S. Fan, and the MARF Research & Development Group. TestFilters -- Testing Filters Framework of MARF. Published electronically within the MARF project, http://marf.sf.net, 2002--2008. Last viewed April 2008.
 
26
S. A. Mokhov, S. Fan, and the MARF Research & Development Group. MathTestApp -- Testing Normal and Complex Linear Algebra in MARF. Published electronically within the MARF project, http://marf.sf.net, 2005--2008. Last viewed April 2008.
 
27
S. A. Mokhov, L. W. Huynh, and J. Li. Managing Distributed MARF with SNMP. Concordia Institute for Information Systems Engineering, Concordia University, Montreal, Canada, Apr. 2007. http://marf.sf.net.
 
28
S. A. Mokhov, L. W. Huynh, J. Li, and F. Rassai. A Privacy Framework within Java Data Security Framework (JDSF): Design Refinement, Implementation, and Statistics. In Submitted for publication at WM-SCI'08, 2008.
 
29
S. A. Mokhov, L. W. Huynh, J. Li, and F. Rassai. Authentication and Integrity Frameworks within Java Data Security Framework (JDSF): Design Refinement, Implementation, and Statistics. Unpublished, 2008.
 
30
S. A. Mokhov, S. Sinclair, I. Clement, D. Nicolacopoulos, and the MARF Research & Development Group. Text-Independent Speaker Identification Application. Published electronically within the MARF project, http://marf.sf.net, 2002--2008. Last viewed April 2008.
 
31
S. A. Mokhov and the MARF Research & Development Group. Language Identification Application. Published electronically within the MARF project, http://marf.sf.net, 2003--2008. Last viewed April 2008.
 
32
S. A. Mokhov and the MARF Research & Development Group. MARF Regression Testing Suite. Published electronically within the MARF project, http://marf.sf.net, 2005--2008. Last viewed April 2008.
 
33
S. A. Mokhov and the MARF Research & Development Group. TestPlugin -- Testing Plug-in Architecture of MARF. Published electronically within the MARF project, http://marf.sf.net, 2005--2008. Last viewed April 2008.
 
34
S. A. Mokhov, L. Wang, and J. Li. Simple Dynamic Key Management in SQL Randomization. In Submitted for publication at STPSA'08, 2008.
 
35
D. Nicolacopoulos, S. A. Mokhov, and the MARF Research & Development Group. TestWaveLoader -- Testing WAVLoader of MARF. Published electronically within the MARF project, http://marf.sf.net, 2002--2008. Last viewed April 2008.
 
36
S. Sinclair, S. A. Mokhov, D. Nicolacopoulos, S. Fan, and the MARF Research & Development Group. TestFFT -- Testing FFT Algorithm Implementation within MARF. Published electronically within the MARF project, http://marf.sf.net, 2002--2008. Last viewed April 2008.
 
37
G. K. Zipf. The Psychobiology of Language. Houghton-Mifflin, New York, NY, 1935. http://en.wikipedia.org/wiki/Zipf%27s_law.