|
ROLE
Author only
AUTHOR PROFILE PAGES (BETA)
Project background
BOOKMARK & SHARE
|
|
|
18 search results
|
Page:
1
2
next
>>
|
|
|
| Export results as:
BibTeX
EndNotes
ACM Ref
|
| 2009
|
1
|
|
Empirical Evaluation of Graph Partitioning Using Spectral Embeddings and Flow
Kevin J. Lang, Michael W. Mahoney, Lorenzo Orecchia
|
|
June 2009
|
|
SEA '09: Proceedings of the 8th International Symposium on Experimental Algorithms
|
|
Publisher: Springer-Verlag
|
|
| Bibliometrics: Downloads (6 Weeks): n/a, Downloads (12 Months): n/a, Citation Count: 0 |
 |
|
We present initial results from the first empirical evaluation of a graph partitioning algorithm inspired by the Arora-Rao-Vazirani algorithm of [5], which combines spectral and flow methods in a novel way. We have studied the parameter space of this ...
|
| |
|
2
|
|
An improved approximation algorithm for the column subset selection problem
Christos Boutsidis, Michael W. Mahoney, Petros Drineas
|
|
January 2009
|
|
SODA '09: Proceedings of the Nineteenth Annual ACM -SIAM Symposium on Discrete Algorithms
|
|
Publisher: Society for Industrial and Applied Mathematics
|
|
Full text available: |
Pdf
(423.87 KB)
|
|
|
| Bibliometrics: Downloads (6 Weeks): 8, Downloads (12 Months): 102, Citation Count: 0 |
 |
|
We consider the problem of selecting the "best" subset of exactly k columns from an m x n matrix A. In particular, we present and analyze a novel two-stage algorithm that runs in O(min{mn2, m2n}) ...
|
| |
|
| 2008
|
3
|
|
Algorithmic and statistical challenges in modern largescale data analysis are the focus of MMDS 2008
Michael W. Mahoney, LekHeng Lim, Gunnar E. Carlsson
|
|
December 2008
|
|
SIGKDD Explorations Newsletter
, Volume 10 Issue 2
|
|
Publisher: ACM
|
|
Full text available: |
Pdf
(184.56 KB)
|
|
|
| Bibliometrics: Downloads (6 Weeks): 8, Downloads (12 Months): 27, Citation Count: 0 |
 |
|
We provide a report for the ACM SIGKDD community about the 2008 Workshop on Algorithms for Modern Massive Data Sets (MMDS 2008), its origin in MMDS 2006, and future directions for this interdisciplinary research area.
|
| |
|
4
|
|
Tensor-CUR Decompositions for Tensor-Based Data
Michael W. Mahoney, Mauro Maggioni, Petros Drineas
|
|
September 2008
|
|
SIAM Journal on Matrix Analysis and Applications
, Volume 30 Issue 3
|
|
Publisher: Society for Industrial and Applied Mathematics
|
|
| Bibliometrics: Downloads (6 Weeks): n/a, Downloads (12 Months): n/a, Citation Count: 1 |
 |
|
Motivated by numerous applications in which the data may be modeled by a variable subscripted by three or more indices, we develop a tensor-based extension of the matrix CUR decomposition. The tensor-CUR decomposition is most relevant as a data analysis ...
Keywords: CUR decomposition, hyperspectral imagery, recommendation system, tensor decomposition
|
| |
|
5
|
|
Unsupervised feature selection for principal components analysis
Christos Boutsidis, Michael W. Mahoney, Petros Drineas
|
|
August 2008
|
|
KDD '08: Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
|
|
Publisher: ACM
|
|
Full text available: |
Pdf
(429.35 KB)
|
|
|
| Bibliometrics: Downloads (6 Weeks): 28, Downloads (12 Months): 396, Citation Count: 3 |
 |
|
Principal Components Analysis (PCA) is the predominant linear dimensionality reduction technique, and has been widely applied on datasets in all scientific domains. We consider, both theoretically and empirically, the topic of unsupervised feature selection ...
Keywords: PCA, random sampling, subset selection
|
| |
|
6
|
|
Relative-Error $CUR$ Matrix Decompositions
Petros Drineas, Michael W. Mahoney, S. Muthukrishnan
|
|
May 2008
|
|
SIAM Journal on Matrix Analysis and Applications
, Volume 30 Issue 2
|
|
Publisher: Society for Industrial and Applied Mathematics
|
|
| Bibliometrics: Downloads (6 Weeks): n/a, Downloads (12 Months): n/a, Citation Count: 2 |
 |
|
Many data analysis applications deal with large matrices and involve approximating the matrix using a small number of “components.” Typically, these components are linear combinations of the rows and columns of the matrix, and are thus difficult ...
Keywords: $CUR$ matrix decomposition, approximate least squares, data analysis, random sampling algorithms
|
| |
|
7
|
|
Sampling subproblems of heterogeneous Max-Cut problems and approximation algorithms
Petros Drineas, Ravi Kannan, Michael W. Mahoney
|
|
May 2008
|
|
Random Structures & Algorithms
, Volume 32 Issue 3
|
|
Publisher: John Wiley & Sons, Inc.
|
|
| Bibliometrics: Downloads (6 Weeks): n/a, Downloads (12 Months): n/a, Citation Count: 0 |
 |
|
Recent work in the analysis of randomized approximation algorithms for NP-hard optimization problems has involved approximating the solution to a problem by the solution of a related subproblem of constant size, where the subproblem is constructed ...
Keywords: CUR matrix decomposition, Max-Cut algorithm, Sampling Linear Programs
|
| |
|
8
|
|
Statistical properties of community structure in large social and information networks
Jure Leskovec, Kevin J. Lang, Anirban Dasgupta, Michael W. Mahoney
|
|
April 2008
|
|
WWW '08: Proceeding of the 17th international conference on World Wide Web
|
|
Publisher: ACM
|
|
Full text available: |
Pdf
(884.00 KB)
|
|
|
| Bibliometrics: Downloads (6 Weeks): 57, Downloads (12 Months): 384, Citation Count: 11 |
 |
|
A large body of work has been devoted to identifying community structure in networks. A community is often though of as a set of nodes that has more connections between its members than to the remainder of the network. In this paper, we characterize ...
Keywords: community structure, conductance, graph partitioning, random walks, social networks
|
| |
|
9
|
|
Sampling algorithms and coresets for ℓp regression
Anirban Dasgupta, Petros Drineas, Boulos Harb, Ravi Kumar, Michael W. Mahoney
|
|
January 2008
|
|
SODA '08: Proceedings of the nineteenth annual ACM-SIAM symposium on Discrete algorithms
|
|
Publisher: Society for Industrial and Applied Mathematics
|
|
Full text available: |
Pdf
(401.50 KB)
|
|
|
| Bibliometrics: Downloads (6 Weeks): 3, Downloads (12 Months): 37, Citation Count: 2 |
 |
|
The ℓp regression problem takes as input a matrix A ∈ ℝn, a vector b ∈ ℝn, and a number p ∈ [1, ∞), and it returns as output a number ...
|
| |
|
| 2007
|
10
|
|
Feature selection methods for text classification
Anirban Dasgupta, Petros Drineas, Boulos Harb, Vanja Josifovski, Michael W. Mahoney
|
|
August 2007
|
|
KDD '07: Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
|
|
Publisher: ACM
|
|
Full text available: |
Mov
(19:55 MIN),
Pdf
(1.02 MB)
|
|
|
| Bibliometrics: Downloads (6 Weeks): 53, Downloads (12 Months): 338, Citation Count: 2 |
 |
|
We consider feature selection for text classification both theoretically and empirically. Our main result is an unsupervised feature selection strategy for which we give worst-case theoretical guarantees on the generalization power of the resultant classification ...
Keywords: feature selection, random sampling, regularized least squares classification, text classification
|
| |
|
|
|
|
|