Publication | Open Access
Variational Autoencoder: An Unsupervised Model for Modeling and Decoding fMRI Activity in Visual Cortex
24
Citations
61
References
2017
Year
Unknown Venue
Convolutional Neural NetworkNeural Networks (Machine Learning)AutoencodersBrain MappingVisual Cognitive NeuroscienceSocial SciencesNatural ImagesUnsupervised ModelVisual CognitionSensory NeuroscienceNetwork NeuroscienceCognitive ScienceNeuroimagingNeural Networks (Computational Neuroscience)Visual ProcessingDeep LearningFmri ActivityPredictive CodingNatural VideosComputational NeuroscienceVariational AutoencoderHuman NeuroscienceNeuroscienceDecoding Fmri ActivityBrain Modeling
Abstract Goal-driven and feedforward-only convolutional neural networks (CNN) have been shown to be able to predict and decode cortical responses to natural images or videos. Here, we explored an alternative deep neural network, variational auto-encoder (VAE), as a computational model of the visual cortex. We trained a VAE with a five-layer encoder and a five-layer decoder to learn visual representations from a diverse set of unlabeled images. Inspired by the “free-energy” principle in neuroscience, we modeled the brain’s bottom-up and top-down pathways using the VAE’s encoder and decoder, respectively. Following such conceptual relationships, we used VAE to predict or decode cortical activity observed with functional magnetic resonance imaging (fMRI) from three human subjects passively watching natural videos. Compared to CNN, VAE resulted in relatively lower accuracies for predicting the fMRI responses to the video stimuli, especially for higher-order ventral visual areas. However, VAE offered a more convenient strategy for decoding the fMRI activity to reconstruct the video input, by first converting the fMRI activity to the VAE’s latent variables, and then converting the latent variables to the reconstructed video frames through the VAE’s decoder. This strategy was more advantageous than alternative decoding methods, e.g. partial least square regression, by reconstructing both the spatial structure and color of the visual input. Findings from this study support the notion that the brain, at least in part, bears a generative model of the visual world.
| Year | Citations | |
|---|---|---|
Page 1
Page 1