Relu Networks

Topic spotlight

TopicWorld Wide

ReLU networks

Discover seminars, jobs, and research tagged with ReLU networks across World Wide.

1 curated item1 Seminar

Updated about 4 years ago

Browse all topics Explore domains

Browse all topics Explore domains

1 items · ReLU networks

1 result

SeminarNeuroscienceRecording

On the implicit bias of SGD in deep learning

Tel Aviv University

Tali's work emphasized the tradeoff between compression and information preservation. In this talk I will explore this theme in the context of deep learning. Artificial neural networks have recently revolutionized the field of machine learning. However, we still do not have sufficient theoretical understanding of how such models can be successfully learned. Two specific questions in this context are: how can neural nets be learned despite the non-convexity of the learning problem, and how can they generalize well despite often having more parameters than training data. I will describe our recent work showing that gradient-descent optimization indeed leads to 'simpler' models, where simplicity is captured by lower weight norm and in some cases clustering of weight vectors. We demonstrate this for several teacher and student architectures, including learning linear teachers with ReLU networks, learning boolean functions and learning convolutional pattern detection architectures.