Tag Index

 AI safety (1)  cognitive control (1)  compositionality (1)  continual learning (4)  curriculum learning (5)  data imbalance (3)  diffusion models (1)  epidemic mitigation (1)  fairness (3)  fatigue (1)  landscape (7)  large language models (1)  lottery ticket hypothesis (1)  machine learning (1)  model collapse (1)  momentum (1)  neural networks (1)  optimal control (1)  optimisation (8)  reinforcement learning (1)  review (2)  spurious correlations (1)  statistical physics (1)  transfer learning (2)

 AI safety (1)

Position: the Stochastic Parrot in the Coal Mine. Model Collapse is a Threat to Low-Resource Communities

 cognitive control (1)

A meta-learning framework for rationalizing cognitive fatigue in neural systems

 compositionality (1)

Why Do Animals Need Shaping? A Theory of Task Composition and Curriculum Learning

 continual learning (4)

Maslow's Hammer for Catastrophic Forgetting: Node Re-Use vs Node Activation
A meta-learning framework for rationalizing cognitive fatigue in neural systems
Optimal Protocols for Continual Learning via Statistical Physics and Control Theory
A Theory of Initialisation's Impact on Specialisation

 curriculum learning (5)

An Analytical Theory of Curriculum Learning in Teacher-Student Networks
RL Perceptron: Generalization Dynamics of Policy Learning in High Dimensions
Why Do Animals Need Shaping? A Theory of Task Composition and Curriculum Learning
Tilting the Odds at the Lottery: the Interplay of Overparameterisation and Curricula in Neural Networks
Curriculum learning in humans and neural networks

 data imbalance (3)

Bias in Motion: Theoretical Insights into the Dynamics of Bias in SGD Training
Bias-inducing geometries: an exactly solvable data model with fairness implications
The Interplay of Data Structure and Imbalance in the Learning Dynamics of Diffusion Models

 diffusion models (1)

The Interplay of Data Structure and Imbalance in the Learning Dynamics of Diffusion Models

 epidemic mitigation (1)

Epidemic mitigation by statistical inference from contact tracing data

 fairness (3)

Bias in Motion: Theoretical Insights into the Dynamics of Bias in SGD Training
Bias-inducing geometries: an exactly solvable data model with fairness implications
The Interplay of Data Structure and Imbalance in the Learning Dynamics of Diffusion Models

 fatigue (1)

A meta-learning framework for rationalizing cognitive fatigue in neural systems

 landscape (7)

Passed & Spurious: Descent Algorithms and Local Minima in Spiked Matrix-Tensor Models
Who is Afraid of Big Bad Minima? Analysis of gradient-flow in spiked matrix-tensor models
Marvels and pitfalls of the Langevin algorithm in noisy high-dimensional inference
Thresholds of descending algorithms in inference problems
Complex Dynamics in Simple Neural Networks: Understanding Gradient Flow in Phase Retrieval
Optimization and Generalization of Shallow Neural Networks with Quadratic Activation Functions
Sharp description of local minima in the loss landscape of high-dimensional two-layer ReLU neural networks

 large language models (1)

Position: the Stochastic Parrot in the Coal Mine. Model Collapse is a Threat to Low-Resource Communities

 lottery ticket hypothesis (1)

Tilting the Odds at the Lottery: the Interplay of Overparameterisation and Curricula in Neural Networks

 machine learning (1)

Thinking of Neural Networks Like a Physicist: The Statistical Physics of Machine Learning

 model collapse (1)

Position: the Stochastic Parrot in the Coal Mine. Model Collapse is a Threat to Low-Resource Communities

 momentum (1)

Analytical Study of Momentum-Based Acceleration Methods in Paradigmatic High-Dimensional Non-Convex Problems

 neural networks (1)

Sharp description of local minima in the loss landscape of high-dimensional two-layer ReLU neural networks

 optimal control (1)

Optimal Protocols for Continual Learning via Statistical Physics and Control Theory

 optimisation (8)

Passed & Spurious: Descent Algorithms and Local Minima in Spiked Matrix-Tensor Models
Who is Afraid of Big Bad Minima? Analysis of gradient-flow in spiked matrix-tensor models
Marvels and pitfalls of the Langevin algorithm in noisy high-dimensional inference
Thresholds of descending algorithms in inference problems
Complex Dynamics in Simple Neural Networks: Understanding Gradient Flow in Phase Retrieval
Optimization and Generalization of Shallow Neural Networks with Quadratic Activation Functions
Analytical Study of Momentum-Based Acceleration Methods in Paradigmatic High-Dimensional Non-Convex Problems
Sharp description of local minima in the loss landscape of high-dimensional two-layer ReLU neural networks

 reinforcement learning (1)

RL Perceptron: Generalization Dynamics of Policy Learning in High Dimensions

 review (2)

Thresholds of descending algorithms in inference problems
Thinking of Neural Networks Like a Physicist: The Statistical Physics of Machine Learning

 spurious correlations (1)

Bias in Motion: Theoretical Insights into the Dynamics of Bias in SGD Training

 statistical physics (1)

Thinking of Neural Networks Like a Physicist: The Statistical Physics of Machine Learning

 transfer learning (2)

Probing transfer learning with a model of synthetic correlated datasets
How to choose the right transfer learning protocol? A qualitative analysis in a controlled set-up