Extracting Affect Aggregates from Longitudinal Social Media Data with Temporal Adapters for Large Language Models

Authors: Georg Ahnert, Max Pellert, David Garcia, and Markus Strohmaier

Abstract

We propose temporally aligned Large Language Models (LLMs) as a tool for longitudinal analysis of social media data. We fine-tune Temporal Adapters for Llama 3 8B on full timelines from a panel of British Twitter users, and extract longitudinal aggregates of emotions and attitudes with established questionnaires. We validate our estimates against representative British survey data and find strong positive, significant correlations for several collective emotions. The obtained estimates are robust across multiple training seeds and prompt formulations, and in line with collective emotions extracted using a traditional classification model trained on labeled data. To the best of our knowledge, this is the first work to extend the analysis of affect in LLMs to a longitudinal setting through Temporal Adapters. Our work enables new approaches towards the longitudinal analysis of social media data.

🦙 Temporal Adapter Training

train_llama3_empiricalData.py trains Temporal Adapters from weekly splits of Twitter data and can be run with accelerate launch for distributed training.
train_llama3_syntheticMix.py trains Temporal Adapters from synthetically mixed (labeled) tweets and can be run with accelerate launch for distributed training.

🤖 Survey Question Inference with Temporal Adapters

eval_llama3_empiricalData.py extracts answers on YouGov's Britains's Mood, Measured Weekly from LLMs equipped with Temporal Adapters, week by week
eval_llama3_syntheticMix.py extracts answers on YouGov's Britains's Mood, Measured Weekly from LLMs equipped with Temporal Adapters trained on synthetically mixed data
extract_answers.py implements a function for scoring survey answers based on token probabilities of causal language models

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
eval_llama3_empiricalData.py		eval_llama3_empiricalData.py
eval_llama3_syntheticMix.py		eval_llama3_syntheticMix.py
extract_answers.py		extract_answers.py
train_llama3_empiricalData.py		train_llama3_empiricalData.py
train_llama3_syntheticMix.py		train_llama3_syntheticMix.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Extracting Affect Aggregates from Longitudinal Social Media Data with Temporal Adapters for Large Language Models

Abstract

🦙 Temporal Adapter Training

🤖 Survey Question Inference with Temporal Adapters

About

Languages

License

dess-mannheim/temporal-adapters

Folders and files

Latest commit

History

Repository files navigation

Extracting Affect Aggregates from Longitudinal Social Media Data with Temporal Adapters for Large Language Models

Abstract

🦙 Temporal Adapter Training

🤖 Survey Question Inference with Temporal Adapters

About

Resources

License

Stars

Watchers

Forks

Languages