ACM Home Page
Please provide us with feedback. Feedback
Grouped graphical Granger modeling methods for temporal causal modeling
Full text MovMov (13:46),  PdfPdf (561 KB)
Source
International Conference on Knowledge Discovery and Data Mining archive
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining table of contents
Paris, France
SESSION: Research track papers table of contents
Pages 577-586  
Year of Publication: 2009
ISBN:978-1-60558-495-9
Authors
Aurelie C. Lozano  IBM T. J. Watson Research Center, Yorktown Heights, NY, USA
Naoki Abe  IBM T. J. Watson Research Center, Yorktown Heights, NY, USA
Yan Liu  IBM T. J. Watson Research Center, Yorktown Heights, NY, USA
Saharon Rosset  Tel Aviv University, Tel Aviv, Israel
Sponsors
ACM: Association for Computing Machinery
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 27,   Downloads (12 Months): 110,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1557019.1557085
What is a DOI?

ABSTRACT

We develop and evaluate an approach to causal modeling based on time series data, collectively referred to as "grouped graphical Granger modeling methods." Graphical Granger modeling uses graphical modeling techniques on time series data and invokes the notion of "Granger causality" to make assertions on causality among a potentially large number of time series variables through inference on time-lagged effects. The present paper proposes a novel enhancement to the graphical Granger methodology by developing and applying families of regression methods that are sensitive to group information among variables, to leverage the group structure present in the lagged temporal variables according to the time series they belong to. Additionally, we propose a new family of algorithms we call group boosting, as an improved component of grouped graphical Granger modeling over the existing regression methods with grouped variable selection in the literature (e.g group Lasso). The introduction of group boosting methods is primarily motivated by the need to deal with non-linearity in the data. We perform empirical evaluation to confirm the advantage of the grouped graphical Granger methods over the standard (non-grouped) methods, as well as that specific to the methods based on group boosting. This advantage is also demonstrated for the real world application of gene regulatory network discovery from time-course microarray data.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
Bach, F. (2008). Consistency of the group Lasso and multiple kernel learning, Technical report HAL-00164735, to appear in Journal of Machine Learning Research.
 
3
 
4
Buhlmann, P. and Yu, B. (2003). Boosting with the L2 Loss: Regression and Classification. J. Amer. Statist. Assoc. 98, 324--340.
 
5
Buhlmann, P. (2007). Twin Boosting: improved feature selection and prediction, Preprint
 
6
Buhlmann, P. (2006). Boosting for high-dimensional linear models. Annals of Statistics 34, 559--583
 
7
Dahlhaus, R. and Eichler, M. (2003), Causality and graphical models in time series analysis. In P. Green, N. Hjort and S. Richardson (eds.): Highly structured stochastic systems, University Press, Oxford.
 
8
Eichler, M. (2007), Granger-causality and path diagrams for multivariate time series. Journal of Econometrics 137, 334--353.
 
9
Enders, W. (2003). Applied Econometric Time Series, 2nd Edition, John Wiley&Sons.
 
10
Hastie, T., Tibshirani R., Friedman JH (2001). The Elements of Statistical Learning. Springer.
 
11
Granger, C. (1980). Testing for causlity: A personal viewpoint. Journal of Economic Dynamics and Control 2, 329--352.
 
12
Hurvich, C., Simonoff, J. and Tsai, C.-L. (1998). Smoothing parameter selection in nonparametric regression using an improved Akaike information criterion. J. R. Stat. Soc. Ser. B Stat. Methodol. 60 271--293. MR1616041
 
13
Lozano, A., Abe, N., Liu, Y, Rosset S. (2009). Grouped Graphical Granger Modeling for Gene Expression Regulatory Networks Discovery. To appear in Proc. ISMB 2009.
 
14
Meinshausen, N. and Bühlmann, P. (2006). High-dimensional graphs and variable selection with the Lasso. Annals of Statistics 34, 1436--1462
 
15
 
16
Sambo F., Di Camillo B, Toffolo G. (2008) CNET: an algorithm for Reverse Engineering of Causal Gene Networks. Bioinformatics Methods for Biomedical Complex Systems Applications. 8th Workshop on Network Tools and Applications in Biology NETTAB2008. Varenna, Italy.
 
17
Spirtes P., Glymour C., and Scheines, R. Causation, Prediction, and Search. MIT Press, New York, NY, second edition, 2000.
 
18
Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. J. Royal. Statist. Soc B., Vol. 58, No. 1, pages 267--288.
 
19
Yuan, M. and Lin, Y. (2006) Model selection and estimation in regression with grouped variables. J. R. Statist. Soc. B, 68, 49--67, pp. 49--67.
 
20
Zou, H. (2006) The Adaptive Lasso and its Oracle Properties. Journal of the American Statistical Society, Vol. 101, No. 476, pp. 1418--1429.
 
21
Zhao, P, Rocha, G. and Yu, B. (2006). Grouped and hierarchical model selection through composite absolute penalties. Annals of Statistics (to appear).
 
22
Zhou, N. and Zhu, J. (2007) Group variable selection via a hierarchical lasso and its oracle property. Unpublished manuscript.


Collaborative Colleagues:
Aurelie C. Lozano: colleagues
Naoki Abe: colleagues
Yan Liu: colleagues
Saharon Rosset: colleagues