Intersection parallel fun math illustrations

11/5/2022

However, they receive more annotations than infants, making them implausible models of the ventral stream development. The best quantitative models of these areas are deep neural networks trained with human annotations. This ability is achieved by their ventral visual stream, multiple hierarchically interconnected brain areas. Primates show remarkable ability to recognize objects. They found that unsupervised and self-supervised methods learned representations that are well-aligned to ventral stream (V1, V4, IT) neurons. This paper was just published in PNAS this year and already has > 60 citations. Unsupervised neural network models of the ventral visual stream From Zhuang et al. Of course, this review reflects my research interests (heavily skewed towards a certain flavor of neuro-AI and vision), but I hope it’s useful to many of you who want to see where the field is going. I went through and reviewed this year’s MAIN conference, NeurIPS, CCN, as well as whatever papers and preprints happened to show in my Twitter feed. If it turns out that this representation is aligned to a brain area, this is a win, as self-supervised and unsupervised methods are more biologically plausible than supervised methods.

CLIP is perhaps the most famous multimodal network – it’s trained contrastively.Īll of these methods allow us to learn a representation without the need for pesky supervision. vision, text, audio, etc.) by predicting one from the other, or predicting a common subspace.

Multimodal learning is a particular flavor of self-supervised training which aims to find a common subspace for two different modalities (e.g.
There are also closely related non-contrastive methods that do away with negative samples, including BYOL and BarlowTwins. There are many different flavors of contrastive learning: MoCo, InfoNCE, SimCLR, CPC, etc.
Contrastive learning is a particular flavor of self-supervised learning where the pretext task is to predict whether a sample is from a positive or negative (or distractor) class (hence “contrast”).
Language models are almost universally trained with self-supervision these days think BERT and GPT-3.
Self-supervised training aims to find good data representations by solving pretext tasks (figure above).
One of the most often used techniques in this space the variational autoencoder (VAE). I’ve covered some unsupervised techniques in the context of dimensionality reduction in neuroscience.
Unsupervised learning aims to represent the data distribution.
Let’s review the methods: Self-supervised learning in a nutshell The alternatives to supervised training are many, and it’s not clear which method is winning right now. Unsupervised, self-supervised, contrastive, multimodal

This year, we’ve seen a lot of headway into finding ways around supervision, and alternatives to supervised training are now competitive. And for a non-human primate or a mouse? Thus, the search for biologically plausible networks which match the human brain is still on. the parent points at a duck and says “duck”) for 3 hours a day, for more than a year. For a human infant to get this level of supervision, they would have to receive a new supervised label every 5 seconds (e.g. Conventional ImageNet training uses 1M images. In particular, task-optimized networks are generally not considered biologically plausible. However, as Jess Thompson points out, these are far from the only forms of explanation.

They answer the teleological question: what is a brain region for? Supervised, task-optimized networks link two important forms of explanation: ecological relevance and accounting for neural activity. The canonical example is the ventral stream, where DNNs trained for object recognition on ImageNet match representations in IT (Khaligh-Razavi & Kriegeskorte, 2014, Yamins et al. One of the most robust findings in neuro-AI is that artificial neural networks trained to perform ecologically relevant tasks match single neurons and ensemble signals in the brain. As 2021 draws to a close, I wanted to take some time to zoom out and review a recent trend in neuro-AI, the move toward unsupervised learning to explain representations in different brain areas footnote. No longer tied to conventional publication venues with year-long turnaround times, our field is moving at record speed. We’re in a golden age of merging AI and neuroscience.

0 Comments

Intersection parallel fun math illustrations

Leave a Reply.

Author

Archives

Categories