ACM Home Page
Author image not provided  Michael W Mahoney

No contact information provided yet.


Authors:
Add personal information
  Affiliation history
Bibliometrics: publication history
Publication years2005-2009
Publication count18
Citation Count63
Available for download10
Downloads (6 Weeks)194
Downloads (12 Months)1,496
SEARCH
ROLE
Arrow RightAuthor only


AUTHOR'S COLLEAGUES
See all colleagues of this author

SUBJECT AREAS
See all subject areas



AUTHOR PROFILE PAGES (BETA)
Project background

BOOKMARK & SHARE


18 search results
 Sort by: 
Page: 1   2    next    >>
Export results as: BibTeX   EndNotes   ACM Ref
2009
1
Empirical Evaluation of Graph Partitioning Using Spectral Embeddings and Flow
Kevin J. Lang, Michael W. Mahoney, Lorenzo Orecchia
June 2009
SEA '09: Proceedings of the 8th International Symposium on Experimental Algorithms
Publisher: Springer-Verlag
Additional Information:full citation, abstract
 Bibliometrics:  Downloads (6 Weeks): n/a,   Downloads (12 Months): n/a,   Citation Count: 0

We present initial results from the first empirical evaluation of a graph partitioning algorithm inspired by the Arora-Rao-Vazirani algorithm of [5], which combines spectral and flow methods in a novel way. We have studied the parameter space of this ...

 
2
An improved approximation algorithm for the column subset selection problem
Christos Boutsidis, Michael W. Mahoney, Petros Drineas
January 2009
SODA '09: Proceedings of the Nineteenth Annual ACM -SIAM Symposium on Discrete Algorithms
Publisher: Society for Industrial and Applied Mathematics
Full text available: PdfPdf (423.87 KB)
Additional Information:full citation, abstract, references, index terms
 Bibliometrics:  Downloads (6 Weeks): 8,   Downloads (12 Months): 102,   Citation Count: 0

We consider the problem of selecting the "best" subset of exactly k columns from an m x n matrix A. In particular, we present and analyze a novel two-stage algorithm that runs in O(min{mn2, m2n}) ...

 
2008
3
Algorithmic and statistical challenges in modern largescale data analysis are the focus of MMDS 2008
Michael W. Mahoney, LekHeng Lim, Gunnar E. Carlsson
December 2008
SIGKDD Explorations Newsletter , Volume 10 Issue 2
Publisher: ACM
Full text available: PdfPdf (184.56 KB)
Additional Information:full citation, abstract, references, index terms
 Bibliometrics:  Downloads (6 Weeks): 8,   Downloads (12 Months): 27,   Citation Count: 0

We provide a report for the ACM SIGKDD community about the 2008 Workshop on Algorithms for Modern Massive Data Sets (MMDS 2008), its origin in MMDS 2006, and future directions for this interdisciplinary research area.

 
4
Tensor-CUR Decompositions for Tensor-Based Data
Michael W. Mahoney, Mauro Maggioni, Petros Drineas
September 2008
SIAM Journal on Matrix Analysis and Applications , Volume 30 Issue 3
Publisher: Society for Industrial and Applied Mathematics
Additional Information:full citation, abstract, index terms
 Bibliometrics:  Downloads (6 Weeks): n/a,   Downloads (12 Months): n/a,   Citation Count: 1

Motivated by numerous applications in which the data may be modeled by a variable subscripted by three or more indices, we develop a tensor-based extension of the matrix CUR decomposition. The tensor-CUR decomposition is most relevant as a data analysis ...

Keywords: CUR decomposition, hyperspectral imagery, recommendation system, tensor decomposition
 
5
Unsupervised feature selection for principal components analysis
Christos Boutsidis, Michael W. Mahoney, Petros Drineas
August 2008
KDD '08: Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Publisher: ACM
Full text available: PdfPdf (429.35 KB)
Additional Information:full citation, abstract, references, index terms
 Bibliometrics:  Downloads (6 Weeks): 28,   Downloads (12 Months): 396,   Citation Count: 3

Principal Components Analysis (PCA) is the predominant linear dimensionality reduction technique, and has been widely applied on datasets in all scientific domains. We consider, both theoretically and empirically, the topic of unsupervised feature selection ...

Keywords: PCA, random sampling, subset selection
 
6
Relative-Error $CUR$ Matrix Decompositions
Petros Drineas, Michael W. Mahoney, S. Muthukrishnan
May 2008
SIAM Journal on Matrix Analysis and Applications , Volume 30 Issue 2
Publisher: Society for Industrial and Applied Mathematics
Additional Information:full citation, abstract, index terms
 Bibliometrics:  Downloads (6 Weeks): n/a,   Downloads (12 Months): n/a,   Citation Count: 2

Many data analysis applications deal with large matrices and involve approximating the matrix using a small number of “components.” Typically, these components are linear combinations of the rows and columns of the matrix, and are thus difficult ...

Keywords: $CUR$ matrix decomposition, approximate least squares, data analysis, random sampling algorithms
 
7
Sampling subproblems of heterogeneous Max-Cut problems and approximation algorithms
Petros Drineas, Ravi Kannan, Michael W. Mahoney
May 2008
Random Structures & Algorithms , Volume 32 Issue 3
Publisher: John Wiley & Sons, Inc.
Additional Information:full citation, abstract, references
 Bibliometrics:  Downloads (6 Weeks): n/a,   Downloads (12 Months): n/a,   Citation Count: 0

Recent work in the analysis of randomized approximation algorithms for NP-hard optimization problems has involved approximating the solution to a problem by the solution of a related subproblem of constant size, where the subproblem is constructed ...

Keywords: CUR matrix decomposition, Max-Cut algorithm, Sampling Linear Programs
 
8
Statistical properties of community structure in large social and information networks
Jure Leskovec, Kevin J. Lang, Anirban Dasgupta, Michael W. Mahoney
April 2008
WWW '08: Proceeding of the 17th international conference on World Wide Web
Publisher: ACM
Full text available: PdfPdf (884.00 KB)
Additional Information:full citation, abstract, references, index terms
 Bibliometrics:  Downloads (6 Weeks): 57,   Downloads (12 Months): 384,   Citation Count: 11

A large body of work has been devoted to identifying community structure in networks. A community is often though of as a set of nodes that has more connections between its members than to the remainder of the network. In this paper, we characterize ...

Keywords: community structure, conductance, graph partitioning, random walks, social networks
 
9
Sampling algorithms and coresets for ℓp regression
Anirban Dasgupta, Petros Drineas, Boulos Harb, Ravi Kumar, Michael W. Mahoney
January 2008
SODA '08: Proceedings of the nineteenth annual ACM-SIAM symposium on Discrete algorithms
Publisher: Society for Industrial and Applied Mathematics
Full text available: PdfPdf (401.50 KB)
Additional Information:full citation, abstract, references, cited by, index terms
 Bibliometrics:  Downloads (6 Weeks): 3,   Downloads (12 Months): 37,   Citation Count: 2

The ℓp regression problem takes as input a matrix A ∈ ℝn, a vector b ∈ ℝn, and a number p ∈ [1, ∞), and it returns as output a number ...

 
2007
10
Feature selection methods for text classification
Anirban Dasgupta, Petros Drineas, Boulos Harb, Vanja Josifovski, Michael W. Mahoney
August 2007
KDD '07: Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Publisher: ACM
Full text available: MovMov (19:55 MIN),  PdfPdf (1.02 MB)
Additional Information:full citation, abstract, references, index terms
 Bibliometrics:  Downloads (6 Weeks): 53,   Downloads (12 Months): 338,   Citation Count: 2

We consider feature selection for text classification both theoretically and empirically. Our main result is an unsupervised feature selection strategy for which we give worst-case theoretical guarantees on the generalization power of the resultant classification ...

Keywords: feature selection, random sampling, regularized least squares classification, text classification
 
  Page: 1   2    next    >>