Object Recognition
object recognition
Jorge Almeida (Proaction Lab)
The Proaction Laboratory (Jorge Almeida’s Lab; proactionlab.fpce.uc.pt) at the University of Coimbra (www.uc.pt), Portugal is looking for 3 motivated and bright Research Assistants to work on a prestigious ERC Starting Grant project (ContentMAP; https://cordis.europa.eu/project/id/802553) on the neural organization of object knowledge. In this project we are exploring how complex information is topographically organized in the brain using fMRI and state of the art analytical techniques, as well as computational approaches, and neuromodulation. We strongly and particularly encourage applications from women, and from underrepresented groups in academia. General Requirements for the positions: 1. Candidates should have a BA and/or MA in Psychology, Cognitive Neuroscience, Computer Science, Computational Neuroscience or any other related field as long as their work relates to the specific profiles below. 2. They should already have their diplomas (so that we can start the process of recognition in Portugal, which is a necessary step for hiring). 3. Interest in object recognition and neural representation. 5. Very good English (oral and written) communicative skills are necessary. Specific requirements for the positions: 1. Understanding of and experience with fMRI and data analysis, and specifically with MVPA. 2. Strong programming skills (matlab, python, etc.) are a requirement. Salary and duration: The position will start as soon as possible and finish in January 2024. The salary is the standard for a PhD student in Portugal – about 1100 per month tax free. Note that cost of living in Portugal (and particularly in Coimbra) is low compared to major European and American cities. Working conditions: The researcher will work directly with Jorge Almeida in Coimbra. The researcher will also be encouraged to develop her/his own projects and look for additional funding so that the stay can be extended. In fact, the expectation is that the applicants start a PhD one year after starting their positions. We have access to 2 3T MRI scanner with a 32-channel coil, to tDCS with neuronavigation, and to a fully set psychophysics lab. We have EEG and eyetracking on site. We also have access, through other collaborations, to a 7T scanner. Finally, the University of Coimbra is a 700 year old University and has been selected as a UNESCO world Heritage site. Coimbra is one of the most lively university cities in the world, and it is a beautiful city with easy access to the beach and mountain. How can I apply: Applicants are encouraged to apply as soon as possible as these positions will be closed as they are filled. Nevertheless, the deadline in May 15. The interested candidates should email Jorge Almeida for questions and applications. Please send an email (jorgealmeida@fpce.uc.pt) with the subject “Research assistant positions under ERC - ContentMAP” with: 1. The Curriculum Vitae with a list of publications, 2. 2 Reference letters 3. A motivation letter with a short description of your experience in the field and how you fulfill the requirements (fit with the position).
Jorge Almeida
The Proaction Laboratory (proactionlab.fpce.uc.pt) at the University Coimbra Portugal is looking for Researchers at initial stages (post-PhD) of their career to be part of the lab in a joint competitive application to a Portuguese Science Foundation (FCT) independent researcher call. We particularly encourage applications from women, and from underrepresented groups in academia. The applicant and the lab will work on a competitive project to be submitted. Results from the application are expected to be out mid 2025. The application will be open September 30, and will close November 29, 2024. The positions are as independent researchers in the Proaction Lab, are for 3 years, and the salary is the same as the Portuguese payroll for University Professors (net values for junior or assistant positions, for instance are approximately 1700 or 2100 euros net-value per month in a 14-month salary per year; these are competitive salaries for the cost of living in Portugal and especially in Coimbra). The Proaction Lab is currently very well funded as we have a set of on-going funded projects including a Starting Grant ERC to Jorge Almeida, a major European ERA Chair project to Jorge ALmeida and Alfonso Caramzza, and other projects. We have access to a 3T MRI scanner with a 32-channel coil, to tDCS, and to a fully set psychophysics lab. We have a 256 ch EEG, motion tracking and eyetracking on site. We also have a science communication office dedicated to the lab. Finally, the University of Coimbra is a 700 year old University and has been selected as a UNESCO world Heritage site. Coimbra is one of the most lively university cities in the world, and it is a beautiful city with easy access to the beach and mountain. You should apply as soon as you can - the sooner the better so that we can prepare the application. If interested send an email to jorgecbalmeida@gmail.com, with a CV, and motivation/scientific proposal letter. If there is a fit, we will jointly apply to these positions – we have had in past applications a high success rate as a lab (in four previous editions, we got several applications that were offered a position).
Jorge Almeida
The Proaction Laboratory (proactionlab.fpce.uc.pt) at the University Coimbra Portugal is looking for Researchers at initial stages (post-PhD) of their career to be part of the lab in a joint competitive application to a Portuguese Science Foundation (FCT) independent researcher call. We particularly encourage applications from women, and from underrepresented groups in academia. The applicants should have obtained a PhD, and have an interest in cognitive neuroscience, vision science and preferably (but not limited to) object recognition, shape processing, and texture and surface processing. We are particularly interested in motivated and independent Researchers addressing these topics with strong expertise in fMRI (in particular decoding and multivariate approaches). Good programming skills, great communication and mentoring skills, and a great command of English are a plus. The applicant and the lab will work on a competitive project to be submitted. Results from the application are expected to be out mid 2025. The application will be open September 30, and will close November 29, 2024. The positions are as independent researchers in the Proaction Lab, are for 3 years, and the salary is the same as the Portuguese payroll for University Professors (net values for junior or assistant positions, for instance are approximately 1700 or 2100 euros net-value per month in a 14-month salary per year; these are competitive salaries for the cost of living in Portugal and especially in Coimbra). The Proaction Lab is currently very well funded as we have a set of on-going funded projects including a Starting Grant ERC to Jorge Almeida, a major European ERA Chair project to Jorge Almeida and Alfonso Caramazza, and other projects. We have access to a 3T MRI scanner with a 32-channel coil, to tDCS, and to a fully set psychophysics lab. We have a 256 ch EEG, motion tracking and eye-tracking on site. We also have a science communication office dedicated to the lab. Finally, the University of Coimbra is a 700-year old University and has been selected as a UNESCO world Heritage site. Coimbra is one of the liveliest university cities in the world, and it is a beautiful city with easy access to the beach and mountain. You should apply as soon as you can - the sooner the better so that we can prepare the application. If interested send an email to jorgecbalmeida@gmail.com, with a CV, and motivation/scientific proposal letter. If there is a fit, we will jointly apply to these positions – we have had in past applications a high success rate as a lab (in four previous editions, we got several applications that were offered a position).
Jorge Almeida
The Proaction Laboratory at the University Coimbra Portugal is looking for Researchers at initial stages (post-PhD) of their career to be part of the lab in a joint competitive application to a Portuguese Science Foundation (FCT) researcher call. The positions are as independent researchers in the Proaction Lab, are for 3 years, and the salary is the same as the Portuguese payroll for University Professors (net values for junior or assistant positions, for instance are approximately 1700 or 2100 euros net-value per month in a 14-month salary per year; these are competitive salaries for the cost of living in Portugal and especially in Coimbra). The Proaction Lab is currently very well funded as we have a set of on-going funded projects including a Starting Grant ERC to Jorge Almeida, a major European ERA Chair project to Jorge Almeida and Alfonso Caramzza, and other projects. We have access to a 3T MRI scanner with a 32-channel coil, to tDCS, and to a fully set psychophysics lab. We have a 256 ch EEG, motion tracking and eyetracking on site. We also have a science communication office dedicated to the lab. Finally, the University of Coimbra is a 700 year old University and has been selected as a UNESCO world Heritage site. Coimbra is one of the most lively university cities in the world, and it is a beautiful city with easy access to the beach and mountain.
Jorge Almeida
The Proaction Laboratory at the University Coimbra Portugal is looking for Researchers at initial stages (post-PhD) of their career to be part of the lab in a joint competitive application to a Portuguese Science Foundation (FCT) independent researcher call. The positions are as independent researchers in the Proaction Lab, are for 3 years, and the salary is the same as the Portuguese payroll for University Professors (net values for junior or assistant positions, for instance are approximately 1700 or 2100 euros net-value per month in a 14-month salary per year; these are competitive salaries for the cost of living in Portugal and especially in Coimbra). The Proaction Lab is currently very well funded as we have a set of on-going funded projects including a Starting Grant ERC to Jorge Almeida, a major European ERA Chair project to Jorge Almeida and Alfonso Caramzza, and other projects. We have access to a 3T MRI scanner with a 32-channel coil, to tDCS, and to a fully set psychophysics lab. We have a 256 ch EEG, motion tracking and eyetracking on site. We also have a science communication office dedicated to the lab. Finally, the University of Coimbra is a 700 year old University and has been selected as a UNESCO world Heritage site. Coimbra is one of the most lively university cities in the world, and it is a beautiful city with easy access to the beach and mountain.
Jorge Almeida
I am looking for a Post-Doctoral Researcher at the initial stages (post-PhD and no more than 2 years and half after obtaining their PhD). The applicants should have obtained a PhD, and have an overall interest in object recognition, potentially focusing on object-related features like shape, texture material and surface properties, and/or object-related action. I am particularly interested in researchers with strong expertise in fMRI, and in particular decoding and multivariate approaches. Good programming skills, great communication and mentoring skills, and a great command of English are a plus. The selected applicant will work with me (Jorge Almeida) but will also benefit from the lively academic environment and research groups we are currently building in the Psychology Department of the University of Coimbra, Portugal. The projects will relate to my work on object and mid-level processing. The position is for 2 to 3 years, and the salary is the standard Post-Doctoral pay-scale in Portugal (net value 1800 euros per month; this is a competitive salary for the cost of living in Portugal and especially in Coimbra). Start time should be as soon as possible. The Proaction Lab is currently well-funded as we have a set of on-going funded projects including a Starting Grant ERC to Jorge Almeida, a major European ERA Chair project to Jorge Almeida and Alfonso Caramazza, and other projects. We have access to a 3T MRI scanner with a 32-channel coil, to a 7T scanner (in collaboration with a site outside of Portugal), to tDCS, and to a fully set psychophysics lab. We have a 256 ch EEG, motion tracking and eyetracking on site. We also have a science communication office dedicated to the lab. Finally, the University of Coimbra is a 700-year-old University and has been selected as a UNESCO world Heritage site. Coimbra is one of the liveliest university cities in the world, and it is a beautiful city with easy access to the beach and mountain.
Single-neuron correlates of perception and memory in the human medial temporal lobe
The human medial temporal lobe contains neurons that respond selectively to the semantic contents of a presented stimulus. These "concept cells" may respond to very different pictures of a given person and even to their written or spoken name. Their response latency is far longer than necessary for object recognition, they follow subjective, conscious perception, and they are found in brain regions that are crucial for declarative memory formation. It has thus been hypothesized that they may represent the semantic "building blocks" of episodic memories. In this talk I will present data from single unit recordings in the hippocampus, entorhinal cortex, parahippocampal cortex, and amygdala during paradigms involving object recognition and conscious perception as well as encoding of episodic memories in order to characterize the role of concept cells in these cognitive functions.
Contentopic mapping and object dimensionality - a novel understanding on the organization of object knowledge
Our ability to recognize an object amongst many others is one of the most important features of the human mind. However, object recognition requires tremendous computational effort, as we need to solve a complex and recursive environment with ease and proficiency. This challenging feat is dependent on the implementation of an effective organization of knowledge in the brain. Here I put forth a novel understanding of how object knowledge is organized in the brain, by proposing that the organization of object knowledge follows key object-related dimensions, analogously to how sensory information is organized in the brain. Moreover, I will also put forth that this knowledge is topographically laid out in the cortical surface according to these object-related dimensions that code for different types of representational content – I call this contentopic mapping. I will show a combination of fMRI and behavioral data to support these hypotheses and present a principled way to explore the multidimensionality of object processing.
Analyzing artificial neural networks to understand the brain
In the first part of this talk I will present work showing that recurrent neural networks can replicate broad behavioral patterns associated with dynamic visual object recognition in humans. An analysis of these networks shows that different types of recurrence use different strategies to solve the object recognition problem. The similarities between artificial neural networks and the brain presents another opportunity, beyond using them just as models of biological processing. In the second part of this talk, I will discuss—and solicit feedback on—a proposed research plan for testing a wide range of analysis tools frequently applied to neural data on artificial neural networks. I will present the motivation for this approach as well as the form the results could take and how this would benefit neuroscience.
Neuroscience of socioeconomic status and poverty: Is it actionable?
SES neuroscience, using imaging and other methods, has revealed generalizations of interest for population neuroscience and the study of individual differences. But beyond its scientific interest, SES is a topic of societal importance. Does neuroscience offer any useful insights for promoting socioeconomic justice and reducing the harms of poverty? In this talk I will use research from my own lab and others’ to argue that SES neuroscience has the potential to contribute to policy in this area, although its application is premature at present. I will also attempt to forecast the ways in which practical solutions to the problems of poverty may emerge from SES neuroscience. Bio: Martha Farah has conducted groundbreaking research on face and object recognition, visual attention, mental imagery, and semantic memory and - in more recent times - has been at the forefront of interdisciplinary research into neuroscience and society. This deals with topics such as using fMRI for lie detection, ethics of cognitive enhancement, and effects of social deprivation on brain development.
Object recognition by touch and other senses
What does the primary visual cortex tell us about object recognition?
Object recognition relies on the complex visual representations in cortical areas at the top of the ventral stream hierarchy. While these are thought to be derived from low-level stages of visual processing, this has not been shown, yet. Here, I describe the results of two projects exploring the contributions of primary visual cortex (V1) processing to object recognition using artificial neural networks (ANNs). First, we developed hundreds of ANN-based V1 models and evaluated how their single neurons approximate those in the macaque V1. We found that, for some models, single neurons in intermediate layers are similar to their biological counterparts, and that the distributions of their response properties approximately match those in V1. Furthermore, we observed that models that better matched macaque V1 were also more aligned with human behavior, suggesting that object recognition is derived from low-level. Motivated by these results, we then studied how an ANN’s robustness to image perturbations relates to its ability to predict V1 responses. Despite their high performance in object recognition tasks, ANNs can be fooled by imperceptibly small, explicitly crafted perturbations. We observed that ANNs that better predicted V1 neuronal activity were also more robust to adversarial attacks. Inspired by this, we developed VOneNets, a new class of hybrid ANN vision models. Each VOneNet contains a fixed neural network front-end that simulates primate V1 followed by a neural network back-end adapted from current computer vision models. After training, VOneNets were substantially more robust, outperforming state-of-the-art methods on a set of perturbations. While current neural network architectures are arguably brain-inspired, these results demonstrate that more precisely mimicking just one stage of the primate visual system leads to new gains in computer vision applications and results in better models of the primate ventral stream and object recognition behavior.
If we can make computers play chess, why can't we make them see?
If we can make computers play chess and even Jeopardy and Go, then why can't we make them see like us? How does our brain solve the problem of seeing? I will describe some of our recent insights into understanding object recognition in the brain using behavioral, neuronal and computational methods.
NMC4 Short Talk: Directly interfacing brain and deep networks exposes non-hierarchical visual processing
A recent approach to understanding the mammalian visual system is to show correspondence between the sequential stages of processing in the ventral stream with layers in a deep convolutional neural network (DCNN), providing evidence that visual information is processed hierarchically, with successive stages containing ever higher-level information. However, correspondence is usually defined as shared variance between brain region and model layer. We propose that task-relevant variance is a stricter test: If a DCNN layer corresponds to a brain region, then substituting the model’s activity with brain activity should successfully drive the model’s object recognition decision. Using this approach on three datasets (human fMRI and macaque neuron firing rates) we found that in contrast to the hierarchical view, all ventral stream regions corresponded best to later model layers. That is, all regions contain high-level information about object category. We hypothesised that this is due to recurrent connections propagating high-level visual information from later regions back to early regions, in contrast to the exclusively feed-forward connectivity of DCNNs. Using task-relevant correspondence with a late DCNN layer akin to a tracer, we used Granger causal modelling to show late-DCNN correspondence in IT drives correspondence in V4. Our analysis suggests, effectively, that no ventral stream region can be appropriately characterised as ‘early’ beyond 70ms after stimulus presentation, challenging hierarchical models. More broadly, we ask what it means for a model component and brain region to correspond: beyond quantifying shared variance, we must consider the functional role in the computation. We also demonstrate that using a DCNN to decode high-level conceptual information from ventral stream produces a general mapping from brain to model activation space, which generalises to novel classes held-out from training data. This suggests future possibilities for brain-machine interface with high-level conceptual information, beyond current designs that interface with the sensorimotor periphery.
Aesthetic preference for art can be predicted from a mixture of low- and high-level visual features
It is an open question whether preferences for visual art can be lawfully predicted from the basic constituent elements of a visual image. Here, we developed and tested a computational framework to investigate how aesthetic values are formed. We show that it is possible to explain human preferences for a visual art piece based on a mixture of low- and high-level features of the image. Subjective value ratings could be predicted not only within but also across individuals, using a regression model with a common set of interpretable features. We also show that the features predicting aesthetic preference can emerge hierarchically within a deep convolutional neural network trained only for object recognition. Our findings suggest that human preferences for art can be explained at least in part as a systematic integration over the underlying visual features of an image.
Seeing with technology: Exchanging the senses with sensory substitution and augmentation
What is perception? Our sensory modalities transmit information about the external world into electrochemical signals that somehow give rise to our conscious experience of our environment. Normally there is too much information to be processed in any given moment, and the mechanisms of attention focus the limited resources of the mind to some information at the expense of others. My research has advanced from first examining visual perception and attention to now examine how multisensory processing contributes to perception and cognition. There are fundamental constraints on how much information can be processed by the different senses on their own and in combination. Here I will explore information processing from the perspective of sensory substitution and augmentation, and how "seeing" with the ears and tongue can advance fundamental and translational research.
Towards a neurally mechanistic understanding of visual cognition
I am interested in developing a neurally mechanistic understanding of how primate brains represent the world through its visual system and how such representations enable a remarkable set of intelligent behaviors. In this talk, I will primarily highlight aspects of my current research that focuses on dissecting the brain circuits that support core object recognition behavior (primates’ ability to categorize objects within hundreds of milliseconds) in non-human primates. On the one hand, my work empirically examines how well computational models of the primate ventral visual pathways embed knowledge of the visual brain function (e.g., Bashivan*, Kar*, DiCarlo, Science, 2019). On the other hand, my work has led to various functional and architectural insights that help improve such brain models. For instance, we have exposed the necessity of recurrent computations in primate core object recognition (Kar et al., Nature Neuroscience, 2019), one that is strikingly missing from most feedforward artificial neural network models. Specifically, we have observed that the primate ventral stream requires fast recurrent processing via ventrolateral PFC for robust core object recognition (Kar and DiCarlo, Neuron, 2021). In addition, I have been currently developing various chemogenetic strategies to causally target specific bidirectional neural circuits in the macaque brain during multiple object recognition tasks to further probe their relevance during this behavior. I plan to transform these data and insights into tangible progress in neuroscience via my collaboration with various computational groups and building improved brain models of object recognition. I hope to end the talk with a brief glimpse of some of my planned future work!
The neuroscience of color and what makes primates special
Among mammals, excellent color vision has evolved only in certain non-human primates. And yet, color is often assumed to be just a low-level stimulus feature with a modest role in encoding and recognizing objects. The rationale for this dogma is compelling: object recognition is excellent in grayscale images (consider black-and-white movies, where faces, places, objects, and story are readily apparent). In my talk I will discuss experiments in which we used color as a tool to uncover an organizational plan in inferior temporal cortex (parallel, multistage processing for places, faces, colors, and objects) and a visual-stimulus functional representation in prefrontal cortex (PFC). The discovery of an extensive network of color-biased domains within IT and PFC, regions implicated in high-level object vision and executive functions, compels a re-evaluation of the role of color in behavior. I will discuss behavioral studies prompted by the neurobiology that uncover a universal principle for color categorization across languages, the first systematic study of the color statistics of objects and a chromatic mechanism by which the brain may compute animacy, and a surprising paradoxical impact of memory on face color. Taken together, my talk will put forward the argument that color is not primarily for object recognition, but rather for the assessment of the likely behavioral relevance, or meaning, of the stuff we see.
The contribution of the dorsal visual pathway to perception and action
The human visual system enables us to recognize objects (e.g., this is a cup) and act upon them (e.g., grasp the cup) with astonishing ease and accuracy. For decades, it was widely accepted that these different functions rely on two separated cortical pathways. The ventral occipitotemporal pathway subserves object recognition, while the dorsal occipitoparietal pathway promotes visually guided actions. In my talk, I will discuss recent evidence from a series of neuropsychological, developmental and neuroimaging studies that were aimed to explore the nature of object representations in the dorsal pathway. The results from these studies highlight the plausible role of the dorsal pathway in object perception and reveal an interplay between shape representations derived by the two pathways. Together, these findings challenge the binary distinction between the two pathways and are consistent with the view that object recognition is not the sole product of ventral pathway computations, but instead relies on a distributed network of regions.
What does the primary visual cortex tell us about object recognition?
A computational explanation for domain specificity in the human brain
Many regions of the human brain conduct highly specific functions, such as recognizing faces, understanding language, and thinking about other people’s thoughts. Why might this domain specific organization be a good design strategy for brains, and what is the origin of domain specificity in the first place? In this talk, I will present recent work testing whether the segregation of face and object perception in human brains emerges naturally from an optimization for both tasks. We trained artificial neural networks on face and object recognition, and found that networks were able to perform both tasks well by spontaneously segregating them into distinct pathways. Critically, networks neither had prior knowledge nor any inductive bias about the tasks. Furthermore, networks optimized on tasks which apparently do not develop specialization in the human brain, such as food or cars, and object categorization showed less task segregation. These results suggest that functional segregation can spontaneously emerge without a task-specific bias, and that the domain-specific organization of the cortex may reflect a computational optimization for the real-world tasks humans solve.
Learning from the infant’s point of view
Learning depends on both the learning mechanism and the regularities in the training material, yet most research on human and machine learning focus on the discovering the mechanisms that underlie powerful learning. I will present evidence from our research focusing on the statistical structure of infant visual learning environments. The findings suggest that the statistical structure of those learning environments are not like those used in laboratory experiments on visual learning, in machine learning, or in our adult assumptions about how teach visual categories. The data derive from our use of head cameras and head-mounted eye trackers capturing FOV experiences in the home as well as in simulated home environments in the laboratory. The participants range from 1 month of age to 24 months. The observed statistical structure offers new insights into the developmental foundations of visual object recognition and suggest a computational rethinking of the problem of visual category formation. The observed environmental statistics also have direct implications for understanding the development of cortical visual systems.
Do better object recognition models improve the generalization gap in neural predictivity?
COSYNE 2022
Linking neural dynamics across macaque V4, IT, and PFC to trial-by-trial object recognition behavior
COSYNE 2022
Linking neural dynamics across macaque V4, IT, and PFC to trial-by-trial object recognition behavior
COSYNE 2022
Distinct roles of excitatory and inhibitory neurons in the macaque IT cortex in object recognition
COSYNE 2023
Spatial-frequency channels for object recognition by neural networks are twice as wide as those of humans
COSYNE 2023
Behavioral impacts of simulated microgravity on male mice: Locomotion, social interactions and memory in a novel object recognition task
FENS Forum 2024
Two distinct ways to form long-term object recognition memory during sleep and wakefulness
FENS Forum 2024
Evaluation of novel object recognition test results of rats injected with intracerebroventricular streptozocin to develop Alzheimer's disease models
FENS Forum 2024
Homecage-based unsupervised novel object recognition in mice
FENS Forum 2024
Interaction of sex and sleep on performance at the novel object recognition task in mice
FENS Forum 2024
Sex-dependent effects of voluntary physical exercise on object recognition memory restoration after traumatic brain injury in middle-aged rats
FENS Forum 2024
Unraveling the mechanisms underlying corticosterone-induced impairment in novel object recognition in mice
FENS Forum 2024
A virtual-reality task to investigate multisensory object recognition in mice
FENS Forum 2024