ACM Home Page
Please provide us with feedback. Feedback
Online collaborative filtering with nearly optimal dynamic regret
Full text PdfPdf (172 KB)
Source ACM Symposium on Parallel Algorithms and Architectures archive
Proceedings of the nineteenth annual ACM symposium on Parallel algorithms and architectures table of contents
San Diego, California, USA
SESSION: Online algorithms and games table of contents
Pages: 315 - 319  
Year of Publication: 2007
ISBN:978-1-59593-667-7
Authors
Baruch Awerbuch  Johns Hopkins University
Thomas P. Hayes  Toyota Technological Institute, Chicago, IL
Sponsors
ACM: Association for Computing Machinery
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
SIGARCH: ACM Special Interest Group on Computer Architecture
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 39,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1248377.1248431
What is a DOI?

ABSTRACT

We consider a model for sequential online decision-making by many diverse agents. On each day, each agent makes a decision, and pays a penalty if it is a mistake. Obviously, it would be good for agents to avoid repeating the same mistakes made by other agents; however, difficulty may arise when some agents disagree over what constitutes a mistake, perhaps maliciously.

As a metric of success for this problem, we consider dynamic regret, i.e., regret versus the off-line optimal sequence of decisions. Previous regret bounds usually use the much weaker notion of static regret, i.e., regret versus the best single decision in hindsight. We assume there is a set of "honest" players whose valuations for the decisions at each time step are identical. No assumptions are made about the remaining players, and the algorithm assumes no information about which are the honest players.

We present an algorithm for this setting whose expected dynamic regret per honest player is optimal up to a multiplicative constant and an additive polylogarithmic term, assuming the number of options is bounded.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
B. Awerbuch and C. Scheideler. Enforced Spreading: An improved protocol for provably secure distributed name service. Technical report, Johns Hopkins University, February 2004.
 
3
B. Awerbuch and C. Scheideler. Group Spreading: A protocol for provably secure distributed name service. In proc 31st ICALP, 2004.
4
 
5
Baruch Awerbuch, David Holmer, Herbert Rubens, and Robert Kleinberg. Provably competitive adaptive routing. In Infocom. IEEE, 2005.
 
6
Baruch Awerbuch and Robert Kleinberg. Collaborative learning. In Proceedings ACM COLT (Computational Learning Theory), 2005.
7
 
8
 
9
 
10
Baruch Awerbuch and Christian Scheideler. Robust distributed name service. In Proc. of 3rd International Workshop on Peer-to-Peer Systems (IPTPS), feb 2004.
11
 
12
Baruch Awerbuch and Christian Scheideler. Robust random number generation for peer-to-peer systems. In In Proc. ACM OPODIS, 2007.
 
13
Baruch Awerbuch and Christian Scheideler. Towards scalable and robust overlay networks. In Proceedings of IPTPS, 2007.
 
14
J. M. Barzdin and R. V. Freivalds. On the prediction of general recursive functions. Soviet Mathematics Doklady, 13:1224--1228, 1972.
 
15


Collaborative Colleagues:
Baruch Awerbuch: colleagues
Thomas P. Hayes: colleagues