NDIF: AN OPEN TOOL ECOSYSTEM FOR PROBING REPRESENTATIONS AND CIRCUITS IN DEEP LEARNING MODELS
Northeastern University
Presentation
Date TBA
Event Information
Poster Board
PS05-09AM-035
Poster
View posterAbstract
Despite advances in high-density electrophysiology and single-cell neuroimaging, recording and modulating full-brain activity at a fine-grained level remains unfeasible. Artificial neural networks (ANNs), such as LLMs, can serve as in silico alternatives for measuring cognitive-like behaviors using neuro-inspired methods, including activation recording, targeted perturbations, and representational analysis. However, most ANN studies focus on small models due to computational constraints, leaving the inner workings of capable frontier systems largely unexplored. The National Deep Inference Fabric (NDIF) bridges this gap by providing open tools and infrastructure for mechanistic analyses of large-scale open-source ANNs.
The NDIF ecosystem comprises three core tools. NNsight is a low-level package for recording and intervening on internal model activations, enabling causal experiments analogous to lesion studies and optogenetic manipulations. NNterp standardizes interpretability techniques across model architectures for reproducible analyses at scale. Finally, Workbench is our interpretability research platform for rapid exploration of model behaviors and AI pedagogy.
We conduct a systematic survey of 184 recent interpretability studies, finding a 40% performance gap on the Massive Multitask Language Understanding (MMLU) benchmark between commonly analyzed models and frontier systems. NDIF addresses this limitation by allowing researchers to access and conduct experiments directly on multi-billion-parameter models available on remote high-performance computing resources, enabling large-scale analyses without dedicated computational infrastructure.
Research using NDIF has already yielded insights into neuro-related questions such as multilingual concept encoding, Theory of Mind capabilities, and sentence-processing mechanisms in LLMs. We invite collaborations from the computational neuroscience community to explore parallels between artificial and biological neural computation.
Recommended posters
IN-DEPTH METADATA ANNOTATION WORKFLOW FOR FAIR SCIENCE AND MACHINE-ACTIONABLE ANALYSES OF COMPLEX EXPERIMENTAL NEUROSCIENCE DATA
Alix Bonard, Laura Morel, Aree Witoelar, Eivind Hennestad, Lyuba Zehl, Trygve B. Leergaard, Andrew P. Davison
A DATA-DRIVEN BIOPHYSICAL FRAMEWORK BRIDGING NEUROMODULATION TO MESOSCALE DYNAMICS
Enia Ardid, Kaleb Belay, Javier Alegre Cortes, Nikolas Karalis
THE STATE OF NEUROSCIENCE: MAPPING RESEARCH TRENDS AND COMMUNITY PERSPECTIVES ACROSS A RAPIDLY EVOLVING FIELD
Emily Singer, Panos Bozelos, Tim Vogels, Moritz Stefaner, Dennis Vasquez Montes, Kristin Ozelli, Staff The Transmitter
DRIADA: BRIDGING SINGLE-NEURON SELECTIVITY AND POPULATION GEOMETRY IN BEHAVING MICE
Nikita Pospelov, Olga Ivashkina, Ksenia Toropova, Viktor Plusnin, Olga Rogozhnikova, Anna Ivanova, Konstantin Anokhin
A HEAD-MOUNTED MINIATURIZED LIGHT-FIELD MICROSCOPE (MINILFMV2) FOR LARGE-SCALE VOLUMETRIC IMAGING OF NEURAL DYNAMICS DURING NATURAL BEHAVIOR
Tobias Nöbauer, Hao Li, Hossein Sarafraz, Alipasha Vaziri
SIMILARLY NONIDEAL: SHARED STRATEGIES IN HUMAN AND ARTIFICIAL VISUAL INFERENCE
Daniele Tirinnanzi, Rudy Skerk, Jean Barbier, Eugenio Piasini