Talk on June 3: David Bamman (UC Berkeley) on “Measuring Representation and Linguistic Variation in Hollywood”

2025/05/16

On Tuesday, June 3, 2025, we cordially invite you to the guest lecture by David Bamman (University of California, Berkeley). The lecture entitled “Measuring Representation and Linguistic Variation in Hollywood” starts at 18:00, in lecture theatre S3|13 30 (Residenzschloss). Below you will find further information about the person and the lecture.

Measuring Representation and Linguistic Variation in Hollywood

Movies are a massively popular and influential form of media, but their computational study at scale has largely been limited due to the availability of data. In this talk, I'll discuss our efforts to build a collection of 2,307 films representing the top 50 movies by U.S. box office over the period 1980 to 2022, along with award nominees. Building this collection allows us to carry out several large-scale computational studies of film; I'll discuss our work measuring changing patterns in the representation of gender and race/ethnicity over the past 43 years (where we see an increase in diversity over the past decade) and in leveraging it to model variation in emotional performances and choice of adverbial intensifiers over both narrative and historical time. This work illustrates a new frontier of the data-driven analysis of film at a large scale.

David Bamman is an associate professor in the School of Information at UC Berkeley, where he works in the areas of natural language processing and cultural analytics, applying NLP and machine learning to empirical questions in the humanities and social sciences. His research focuses on improving the performance of NLP for underserved domains like literature (including LitBank and BookNLP) and exploring the affordances of empirical methods for the study of literature and culture. Before Berkeley, he received his PhD in the School of Computer Science at Carnegie Mellon University and was a senior researcher at the Perseus Project of Tufts University.