251-0566-00L Summer semester 2006
This class is a weekly reading group discussing research papers on causality inference from observational or experimental data. In a purely observational setting, quantities of interest (variables) can be recorded, but not acted upon. In an experimental setting, some controllable variables can be acted upon. The selected papers aim at understanding machine learning techniques to infer causality, including causal graphs derived from "graphical models”.
Tuesday 17:00-18:00PM, CAB H 57 (plan)
Pattern Analysis and Machine Learning Group
Office: CAB G 69.2 Universitätstrasse 6
Phone: (044) 63 23124
E m a il: firstname.lastname@example.org
These are recommended but will not be strictly enforced.
Each week, the student will have to read a paper or a book chapter, which will be discussed in class. The students are encouraged to familiarize themselves with Bayesian networks using the Genie software. The software works directly on Windows platforms, but may also be run through Wine on Unix.
This is tested on Ubuntu Linux 5.10:
* Install wine (http://www.winehq.com/). Most distributions should
have their own way to install this package, e.g. "apt-get
install wine" on debian based distributions.
* Download setup file of genie and run: "wine genie2_setup.exe",
this should install all the needed files to
~/.wine/drive_c/Program Files/GeNIe 2.0
* change to this directory and run: "wine genie.exe"
The class is worth 4 credit points
Each student will have to take the lead of the discussion on one paper. Students should sign up for a date. This will be done on a first-come first-serve basis. Please email the instructor email@example.com. This assignment will require a little more preparation that particular week, including writing a summary of the paper and a log of the discussion. This is NOT a presentation: slides may be used as partial support, but are not required.
There will be no exam. Attendance to at least 2/3 of the reading groups is required. There will be a sign-up sheet every time to check presence and completion of the homework assignment (usually reading a paper or book chapter). You earn one point for presence and one point for homework completion. You must earn at least 18 points to get your credit points for the class.
The readings are organized around 4 themes revolving around book chapters from Judea Pearl's book "Causality". They are complemented by algorithm/theory papers and book chapters of other authors presenting a complementary viewpoints and application papers. This tentative list of papers is subjet to change, please check every week the assignement for the following week. Copies of the paper to read the next week will be provided. Send requests to firstname.lastname@example.org.
||Paper||Discussion leader||Summary, discussion, and/or slides|
||Graphical models and causality
reading group presentation (tutorial)
||I. Basic concepts
models without tears, Eugene Charniak
||Install Genie. Build
an example network with the software implementing the example
of Fig. 2 of the paper.
||Probabilistic Reasoning. Chapter 14 of the book "Artificial Intelligence:
A modern approach" by Stuart Russell and Peter Norvig.
||Introduction to probabilities,
graphs, and causal models. Chapter 1 of the book "Causality" by Judea Pearl.
chapter is available in pdf.
||II. Basic methods
||Belief propagation. An
introduction to factor graphs, by Hans-Andrea Loeliger.
||Structure learning. A tutorial on
learning with Bayesian networks, by David Heckerman.
[ppt]. Slides [pdf].
||Variational methods. An
Introduction to Variational Methods for Graphical Models,
by Michael Jordan, Zoubin Ghahramani, Tommi Jaakkola, and
||Summary. Slides . Slide handouts.
||A theory of inferred causation.
Chapter 2 of the book "Causality" by Judea
(see also availability from Pearl's site)
||III. Identification of causal
||Statistical causality analysis
of INFOSEC alert data
Xinzhou Qin and Wenke Lee
||Causal diagrams and the identification
of causal effects. Chapter 3 of the book "Causality" by Judea Pearl.
(see also availability from Pearl's site)
Bayesian networks to analyze expression data Nir Friedman, Michal Linial,
Iftach Nachman, and Dana Pe'er
||IV. Control, action, planning
||The art of science
of cause and effect. Epilogue of the book "Causality" by Judea Pearl.
||N-1 experiments suffice to
determine the causal relations among N variables, Frederick Eberhardt, Clark
Glymour, Richard Scheines
structural models and causal inference in epidemiology,
by James Robins, Miguel Angel Hernan, and Babette
1) Bayesian networks and Decision graphs - FB Jensen.
2) Bayesian artificial Intelligence - Kevin B. Korb and Anne K. Nicholson
3) An introduction to Graphical models - Kevin P. Murphy (http://www.cs.ubc.ca/~murphyk/Papers/intro_gm.pdf)
4) Bayesian networks and beyond - Unpublished book Daphne Koller and Nir Friedman
1) Hugin (http://www.hugin.com) is an excellent tool to construct and test Bayesian networks.
2) WinBUGS (http://www.mrc-bsu.cam.ac.uk/bugs/winbugs/contents.shtml) is a statistical tool to estimate the Bayesian inference with MCMC simulations using Gibbs Sampling.
Ebooks from the ETH library:
*Applied Bayesian modelling. Peter Congdon - Wiley (2003) (http://www3.interscience.wiley.com/cgi-bin/booktoc/104531773?CRETRY=1&SRETRY=0 )
*Bayesian approach to image interpretation. Sunil K. Kopparapu, Uday B. Desai - Kluwer Academic Publishers (2001) (http://ebooks.springerlink.com/summary.asp?id=70076 )
*Bayesian approaches to clinical trials and health-care evaluation. David J. Spiegelhalter, Keith R. Abrams, Jonathan P. Myles - Wiley (2004) (http://www3.interscience.wiley.com/cgi-bin/booktoc/107614005 )
*Bayesian artificial intelligence. Kevin B. Korb, Ann E. Nicholson - Chapman & Hall/CRC (2004) (http://www.statsnetbase.com/books/1219/c3871_fm.pdf )
*Bayesian economics through numerical methods: a guide to econometrics and decision-making with prior information. Jeffrey H. Dorfman - Springer (1997) http://ebooks.springerlink.com/summary.asp?id=99654 )
*Bayesian forecasting and dynamic models. Mike West, Jeff Harrison - Springer (cop. 1997) (http://ebooks.springerlink.com/summary.asp?id=104512 )
*Bayesian nonparametrics. J.K. Ghosh, R.V. Ramamoorthi - Springer (2003) (http://ebooks.springerlink.com/summary.asp?id=98937 )
*Introduction to Bayesian statistics. William M. Bolstad - Wiley (2004) (http://www3.interscience.wiley.com/cgi-bin/bookhome/109855377 )
*Likelihood, Bayesian, and MCMC methods in quantitative genetics. Daniel Sorensen, Daniel Gianola - Springer (2002) (http://ebooks.springerlink.com/summary.asp?id=98912 )
*Measurement error and misclassification in statistics and epidemiology: impacts and bayesian adjustments. Paul Gustafson - CRC Press Company (2004) (http://www.statsnetbase.com/books/1196/c3359_fm.pdf )
*Multivariate Bayesian statistics: models for source separation and signal unmixing. Daniel B. Rowe - Chapman & Hall/CRC (2003) (http://www.statsnetbase.com/books/980/c3189_fm.pdf )