Associate Professor (2001-2005), Department of Psychology and Center for Neuroscience, UC Davis
Associate Professor (2005-2010), Helen Wills Neuroscience Institute and School of Optometry, UC Berkeley
Professor (2010–present), Helen Wills Neuroscience Institute and School of Optometry, UC Berkeley
Research
Olshausen's research focuses on understanding the information processing strategies employed by the visual system for tasks such as object recognition and scene analysis. His approach combines studying neural response properties with mathematical modeling to develop functional theories of vision. This work aims to both advance understanding of brain function and develop new algorithms for image analysis based on biological principles. He has also contributed to technological applications, including image and signal processing, alternatives to backpropagation for unsupervised learning, memory storage and computation, analog data compression systems, etc.
Neural coding
One of Olshausen's most significant contributions is demonstrating how the principle of sparse coding can explain response properties of neurons in visual cortex. His 1996 paper in Nature with David J. Field showed how simple cells in the V1 cortexreceptive field properties could emerge from learning a sparse code for natural images.[3] This paper is based on two previous reports that gave additional technical details.[4][5]
Features learned by generalized Hebbian algorithm running on 8-by-8 patches of Caltech 101
The paper argued that simple cells have Gabor-like, localized, oriented, and bandpass receptive fields. Previous methods, such as generalized Hebbian algorithm, obtains Fourier-like receptive fields that are not localized or oriented. But with sparse coding, such receptive fields do emerge.
Specifically, consider an image and some receptive fields . An image can be approximately represented as a linear sum of the receptive fields: . If so, then the image can be coded as , a code which may have better properties than directly coding for the pixel values of the image.
For each image in the batch, solve for the coefficients that minimize the loss function
Define the reconstructed image .
Update each feature by Hebbian learning: . Here, is the learning rate and the expectation is over all images in the batch.
Update each by . Adjust learning rate.
The key part of the algorithm is the loss functionwhere the first term is image reconstruction loss, and the second term is the sparsity loss. Minimizing the first term leads to accurate image reconstruction, and minimizing the second term leads to sparse linear coefficients, that is, a vector with many almost-zero entries. The hyperparameter balances the importance of image reconstruction vs sparsity.
Based on the 1996 paper, he worked out a theory that the Gabor filters appearing in the V1 cortex performs sparse coding with overcomplete basis set, such that it is optimal for images occurring in the natural habitat of humans.[6][7]