Most of the papers in this list were found through one of a small number of methods:

I'm not trying to make any claim that this list is complete in any way, or that any of my comments on these papers are correct by any standard, or even that interesting. These are just my notes to myself as I attempt to wrap my head around the subject. Caveat lector, as the kids say.

At some point, all the local links will disappear, and only the pointers to remote locations will remain.


Philosophy of Causal Discovery

Judea Pearl Others
Pearl, "Statistics and Causal Inference: A Review"  [local] Richard Scheines, "An Introduction to Causal Inference"  [local]
Pearl, "Causal Diagrams for Empirical Research"  [local] Peter Spirtes, "The Limits of Causal Inference from Observational Data"   [local]
Judea Pearl and Thomas Verma, "The Logic of Representing Dependencies by Directed Graphs"  [local] Peter Spirtes, "Directed Cyclic Graphical Representations of Feedback Models"  [local]
Clark Glymour, "Statistics and Causal Inference: Comment: Statistics and Metaphysics"  [local]
Makes me want to find my Kripke.

Statistical Constraints from Causal Models

If you assume a particular causal model has generated the observed data from an experiment, then the structure of the causal model often implies that certain "regularities" are present in the data. Most causal inference procedures turn this observation on its head -- having developed a statistical test for a particular regularity, the detection of that regularity in actual data is used to rule out causal models that would not have produced it. (This is a sound procedure given a property usually called "faithfulness," which can be roughly translated as "Nature won't fool you with spurious regularities by coincidence.") Most of the regularities that are induced by causal models are presented as either "conditional independencies" or "tetrad constraints."
A.P. Dawid, "Conditional Independence in Statistical Theory"  [local]
Bollen & Ting, "A Tetrad Test for Causal Indicators"  [local]
Zhang, Spirtes, "A Transformational Characterization of Markov Equivalence for Directed Maximal Ancestral Graphs"  [local]
From Section 2: "Markov equivalent DMAGs [directed maximal ancestral graphs] can be transformed to each other by a sequence of single mark changes that preserve Markov equivalence." The point, I suppose (and this is supported by their Conclusion section) is that you can use this result to prove that particular properties must by uniform across certain Markov equivalence classes of graphs.
Kang & Tian, "Inequality Constraints in Causal Models with Hidden Variables"  [local]
Introduces inequality constraints on the probabilities of "intervential distributions."

Basics

The easiest task (of course, nothing is easy here) is to infer a "causal" network from a dataset where all the variables have been observed.
Lauritzen, "Causal Inference from Graphical Models"  [local]
Robins, Scheines, Spirtes, & Wasserman, "Uniform Consistency in causal inference"  [local]
A fundamental negative result: "We show that the asymptotically consistent procedures [for estimation of causal effects] are 'pointwise consistent,' but 'uniformly consistent' tests do not exist. Thus, no finite sample size can ever be guaranteed to approximate the asymptotic results."
Richardson, "A Discovery Algorithm for Directed Cyclic Graphs"  [local]
Outlines a method for building cyclic graph structures -- "linear non-recursive structural equation models." Doesn't explicitly address hidden variables, but can be used as a drop-in subroutine for the discovery of hidden variable graph structures in (for example) the Silva "Linear Latent Variable Models" paper, below.
Tian & Pearl, "A New Characterization of the Experimental Implications of Causal Bayesian Networks"   [local]
Ramsay, Spirtes, & Zhang, "Adjacency-Faithfulness and Conservative Causal Inference"  [local]

Discovery Methods with Hidden Variables

Of course, in a complicated system we will be faced with incomplete data -- there may be unobserved common causes, which could explain correlations in the observed data. Papers in this section involve algorithmic and statistical methods for dealing with this problem: attempting to discover "hidden" variables, and to infer the causal networks that might hold among those hidden variables.
Spirtes, Meek, & Richardson, "Causal Inference in the Presence of Latent Variables and Selection Bias"  [local]
Tian & Pearl, "On the Testable Implications of Causal Models with Hidden Variables"  [local]
Ali, Richardson, Spirtes, & Zhang, "Towards characterizing Markov equivalence Classes for Directed Acyclic Graphs with Latent Variables"  [local]
Silva, Scheines, Glymour, & Spirtes, "Learning the Structure of Linear Latent Variable Models"  [local]
Partitions the "measurement" or observed variables up into separate blocks, and uses them to identify distinct latent variables; then, builds a network on those latent variables. They contrast their method with Elidan's (below), and briefly discuss alternative systems for building the network on the hidden variables (Richardson, above). Most of the focus of this paper appears to be on the identification of hidden variables via blocks of observations.
Elidan, Lotner, Friedman, & Koller, "Discovering Hidden Variables: A Structure-Based Approach"  [local]
Zhang, "Hierarchical Latent Class Models for Cluster Analysis"  [local]
Attempts to build a hierarchical (i.e. tree-like) structure on the hidden class variables.

Bioinformatics and Gene-Regulatory Networks

Causal network discovery seems to have gotten its start in the social sciences (economics, sociology, psychology and psychometrics) and other disciplines that measure human behavior. However, some of the most recent applications of these techniques have been focused on biological applications. Starting in the late '90s with the advent of microarray technology (and continuing today with cheap sequencing, but more on that in another place at another time), researchers have thought it possible to decode the "regulatory network" or "expression network" of a cell -- the causal network that underlies the varying transcription of genes, and (ultimately) the multifarious and dynamic phenotypes exhibited by cells and organisms.
Spirtes et al. "Constructing Bayesian Network Models of Gene Expression Networks from Microarray Data"  [local]
A classic example of building an "expression network" from microarray-style experiments.
Chu, Glymour, Scheines, & Spirtes, "A statistical problem for inference to regulatory structure from associations of gene expression measurements with microarrays"  [local]
Another important negative result! Accurate reconstruction of genetic regulatory networks is unlikely to be possible unless we explicitly correct for temporal and population averaging effects inherent in microarray experiments.
Wimberly, Heiman, Ramsey, & Glymour "Experiments on the Accuracy of Algorithms for Inferring the Structure of Genetic Regulatory Networks from Microarray Expression Levels"  [local]
Another negative result -- as of 2003, most gene regulatory network reconstructions managed to find accurate connections between gene pairs "at chance." I suppose it's an open question whether "gene pairs" are the right level at which to do this analysis, and it probably bears some investigation to figure out whether things have gotten better since 2003 (which was, let's point out, the year in which the first paper to attempt a complete measurement of the "regulatory network" of an organism was attempted, although we all know how well that turned out). But also, this might be simply interpreted as saying that the 'regulatory networks' which were built before 2003 could be reasonable in terms of their final predictions, as statistical models, but less-worthy of the term "causal models."

Software

Scheines, Spirtes, Glymour, Meek, & Richardson, "The TETRAD Project: Constraint-based Aids to Causal Model Specification"  [local]
Zupan et al. "GenePath: a system for automated construction of genetic networks from mutant data."  [local]
Also connected with some online software.

Blog Posts

Some collected blog-posts about issues surrounding causal inference:

Andrew Gelman:

  1. "Causal inference and regression, or, chapters 9, 10, and 23"
  2. "How to think about instrumental variables when you get confused"
  3. "Where do instrumental variables come from? And my own favorite (unused) example"

Cosma Shalizi:

  1. "Quantuam Causal Influence"