

Title: Using sequences of life-events to predict human lives
Abstract: Here we represent human lives in a way that shares structural similarity to language, and we exploit this similarity to adapt natural language processing techniques to examine the evolution and predictability of human lives based on detailed event sequences. We do this by drawing on a comprehensive registry dataset, which is available for Denmark across several years, and that includes information about life-events related to health, education, occupation, income, address and working hours, recorded with day-to-day resolution. We create embeddings of life-events in a single vector space, showing that this embedding space is robust and highly structured. Our models allow us to predict diverse outcomes ranging from early mortality to personality nuances, outperforming state-of-the-art models by a wide margin. Using methods for interpreting deep learning models, we probe the algorithm to understand the factors that enable our predictions. Our framework allows researchers to discover potential mechanisms that impact life outcomes as well as the associated possibilities for personalized interventions. I will also present recent advances from the stream of research associated with the work described above.
Bio: Sune Lehmann is a Professor of Complexity and Network Science at DTU Compute, Technical University of Denmark and a Professor of Data Science at the Center for Social Data Science, University of Copenhagen. His work focuses on quantitative understanding of social systems based on massive data sets. A physicist by training, his research draws on approaches from the physics of complex systems, machine learning, and statistical analysis. Sune works on large-scale behavioral data and while his primary focus is on modeling complex networks, his research has made substantial contributions on topics such as human mobility, sleep, academic performance, complex contagion, epidemic spreading, and behavior on online social networks. He is a member of the Royal Danish Academy of Sciences and Letters and a chief scientist at the Danish National Center for AI in Society (CAISA)