Causal models, learning, & video games

University of Minnesota, Spring Semester, 2009

Recent evidence suggests that theories of Bayesian agents may provide good first-order models of how humans learn to make optimal decisions in dynamic complex tasks, such as those requiring learning underlying causal structure of the world and the effects of our actions on it. Optimal in this context is defined in terms of attaining the task goal, while minimizing loss and maximizing gains. Recently, it has also been shown that experience with certain types of video games can produce generalized transfer of learning. Although this recent theoretical and empirical research is promising, it is important to test predictions under conditions of increasing realism if we are to understand how optimal learning of complex dynamic tasks can be induced. Video game technology provides an unprecedented opportunity to experimentally manipulate and control task factors involved in skill acquistion, including manipulating the underlying causal models, task constraints, and reward models. In this course, we will discuss literature on theories of perception, cognition, and action from the point of view of Bayes agents, together with recent behavioral research on learning and skill acquisition.

Format: Discussion of journal articles led by seminar members, term paper or term project on a related topic.


Jan 20 Introduction Dan Kersten (pdf), Paul Schrater (pdf) & Shawn Green
Jan 27 Ahissar, M., Nahum, M., Nelken, I., & Hochstein, S. (2008). Reverse hierarchies and sensory learning. Philos Trans R Soc Lond B Biol Sci. (pdf) Shawn Green (pdf)
Feb 3 Kaelbling LP, Littman ML, Moore AW. (1996) Reinforcement Learning: A Survey. Journal of Artificial Intelligence Research, Vol 4, 237-285 (pdf) Paul Schrater (pdf)
Green, C. S., & Bavelier, D. (2008). Exercising your brain: A review of human brain plasticity and training-induced learning. Psychol Aging, 23(4), 692-701. (pdf)

Xiao, L. Q., Zhang, J. Y., Wang, R., Klein, S. A., Levi, D. M., & Yu, C. (2008). Complete transfer of perceptual learning across retinal locations enabled by double training. Curr Biol, 18(24), 1922-1926. (pdf)

Dosher, B. A., & Lu, Z. L. (2007). The functional form of performance improvements in perceptual learning: learning rates and transfer. Psychol Sci, 18(6), 531-539. (pdf)




Kilgard, M. P., & Merzenich, M. M. (1998). Cortical map reorganization enabled by nucleus basalis activity. Science, 279(5357), 1714-1718. (pdf)

Bao, S., Chan, V. T., & Merzenich, M. M. (2001). Cortical remodelling induced by activity of ventral tegmental dopamine neurons. Nature, 412(6842), 79-83. (pdf)

Koepp, M. J., Gunn, R. N., Lawrence, A. D., Cunningham, V. J., Dagher, A., Jones, T., et al. (1998). Evidence for striatal dopamine release during a video game. Nature, 393(6682), 266-268. (pdf)


Mar 3 Kemp, C. and Tenenbaum, J. B. (2008). The discovery of structural form. Proceedings of the National Academy of Sciences. 105(31), 10687-10692. (pdf) (suppl pdf)  
Steyvers et al. (2003) (pdf)

(Supplementary links: Pearl's cite, including a review.)

Mar 24 Dearden et al.1998 (pdf), Niv et al. 2006 (pdf)  
Poupart et al. 2007 (pdf) (ICML 07 video)

Strens 2000 (pdf)

Apr 7 Dayan & Daw 2008 (pdf) Cohen, McClure & Yu 2007 (pdf)  
Apr 14 Kording et al. 2008) (pdf) Sloman et al. 2006 (pdf)  
Apr 21 Boutilier et al. 1995 (pdf)  
Cutumisu et al. 2008 (pdf)


May 5 Discussion of Final Project goals -- Guidelines  
May 16 Final Project Due -- See Guidelines  

Final Project Results -- Selected Bibliographies: CD, JS, MAA, ME, SJ


Final Assignment

What Should Transfer? How the credit assignment problem is solved should affect
what is transferable and generalizability:

1. Learned policy transfer

2. Perceptual model transfer

3. World model transfer

4. Reward model transfer

5. World metamodel /Reward metamodel