Fermilab Theory Seminars

Neural Networks, Scaling Laws and Effective Field Theories

Name: Neural Networks, Scaling Laws and Effective Field Theories
Start: 2024-11-21T14:30:00-06:00
End: 2024-11-21T15:30:00-06:00
Location: No location set

by Zhengkang (Kevin) Zhang (University of Utah)

Thursday 21 Nov 2024, 14:30 → 15:30 US/Central

Hornet’s Nest-8th floor (WH8XO)

Description

Starting from simple curve fitting problems, I will explain how modern AI works by learning a large number of features from data: wide neural networks fit data to linear combinations of many random features, and stacking layers to form deep neural networks allows the features to evolve according to data. It has been empirically observed that the performance of neural networks scales as power laws with respect to the sizes of the model and training data set. I will discuss a recently proposed random feature model that captures the physics of neural scaling laws, and its solution in an effective theory framework using large-N field theory methods. The solution reveals a duality that is indicative of a deeper connection between neural networks and field theories.