
arXiv:2605.25312v1 Announce Type: new Abstract: We introduce P1SCO, a dataset of social media comments collected from three distinct platforms, annotated according to ten social dimensions to capture the diversity of social interactions and perceptions. The dataset is carefully disaggregated to allow analysis at the level of individual comments, annotators, and platforms. In addition to the social dimension labels, we include rich metadata on the annotators, including demographics, Big Five personality profiles, and political affiliation. This combination of comment-level annotations and annot
The proliferation of social media and AI's increasing role in content analysis makes understanding nuanced social dimensions from public data critical now.
This dataset offers a more granular understanding of social interactions and perceptions, crucial for developing robust and ethically aligned AI systems.
The availability of this disaggregated and rich dataset allows for more sophisticated analysis of social dynamics and AI's interaction with them across different platforms and demographic groups.
- · AI ethics researchers
- · Social science researchers
- · NLP developers
- · Social media platforms
- · Developers of uncurated, biased AI models
Improved understanding of social dynamics in online interactions for AI development.
Development of more socially aware and less biased AI models due to better training data.
Enhanced AI capability to navigate complex social contexts, potentially leading to more effective and trusted human-AI collaboration.
This signal links to a primary source. Continuum Brief monitors and indexes it as part of the live intelligence stream — we do not republish source content.
Read at arXiv cs.CL