ClimateChat-300K: A Multi-Modal Facebook Dataset for Understanding Diverse Perspectives in Climate Communication

arXiv:2605.23326v1 Announce Type: new Abstract: We present ClimateChat-300K, a large-scale dataset of 299,329 public Facebook posts about climate change collected between May 2020 and May 2024 through the CrowdTangle platform. The dataset contains 41 metadata features including post content, engagement metrics, and page attributes, covering material from more than 26,000 global pages. Each post includes rich contextual information such as language, timestamp, page category, and interaction counts, enabling comprehensive analyses of public discourse around climate communication. Using topic mod
The proliferation of social media data combined with advancements in AI for natural language processing makes the creation and analysis of such datasets feasible and timely.
This dataset provides a critical resource for understanding public discourse on climate change, which is essential for effective climate communication strategies and policy development.
Researchers and policymakers now have a large, multi-modal dataset to analyze diverse perspectives on climate change, moving beyond anecdotal evidence to data-driven insights.
- · Climate scientists
- · Social media researchers
- · Data scientists
- · ESG investors
- · Platforms with opaque data policies
- · Organizations relying solely on traditional polling
Improved understanding of public sentiment and misinformation vectors related to climate change.
More targeted and effective climate communication campaigns and policy initiatives informed by data.
Potential for AI agents to dynamically adapt climate narratives to specific audience segments, influencing public opinion at scale.
This signal links to a primary source. Continuum Brief monitors and indexes it as part of the live intelligence stream — we do not republish source content.
Read at arXiv cs.CL