My homepage has moved to www.fransoliehoek.net, please update your bookmarks. (you will be redirected in 5 seconds.)

General research interests

My main interests lie in the field of sequential decision making under uncertainty. This field studies methods that allow an intelligent system, an agent, to make decisions over time. I am particularly interested in multiagent systems. Sequential decision making is studied in, and has close ties to many research fields within AI, CS and economics such as: planning, reinforcement learning, optimization, machine learning, regression, classification, graphical models, probabilistic inference and game theory.

Particular topics of study

I have been engaged in research on the following topics:

  • Decentralized POMDPs: theory, optimal and approximate planning methods.
  • Exploiting structure in graphical models of interactions.
  • Reinforcement learning for teams of agents.
  • Smartly coordinating traffic lights.
  • Reasoning about trust ine-commerce settings via POMDPs.
  • Mining social structures: dealing with entity resolution, and exploratory support for historal databases.
  • Team decision making with delayed communication.
  • Poker formalized as POMDPs.

Research Highlighted

A selection of some research contributions, including slides.

Advances in Optimal Solutions for Dec-POMDPs

The optimal solution of Dec-POMDPs is a notoriously hard problem. Yet, the field is making steady progress in solving larger POMDPs.

Incremental clustering and incremental expansion are two techniques that are introduced in our JAIR'13 article that have greatly expanded the range of problems that can be solved optimally. More recently, I found a reformulation of optimal value functions based on sufficient statistics that may enable more compact representations of those value functions and potentially can lead to speed ups.

For more information, see...

Approximate Solution of Factored Dec-POMDPs via Transfer Planning

In recent work we have introduced transfer planning, a method to compute a factored heuristic for factored Dec-POMDPs. By using this in a factored version of forward-sweep policy computation, we can scale to hundreds of agents.

The main idea behind transfer planning is that we approximate the original problem using a set of multiple abstractions, each involving subsets of agents. The value functions of these so-called source problems, can then be transfered to the original problem.

For more information, see...