The DSA Transparency Database: Auditing Self-reported Moderation Actions by Social Media

Read original: arXiv:2312.10269 - Published 8/2/2024 by Amaury Trujillo, Tiziano Fagni, Stefano Cresci

The DSA Transparency Database: Auditing Self-reported Moderation Actions by Social Media

Overview

The paper presents the DSA Transparency Database, a project that audits self-reported moderation actions by social media platforms.
The database aims to provide independent verification and analysis of platforms' claims about their content moderation practices.
The researchers collect and analyze data from platforms' transparency reports to assess the accuracy and completeness of their disclosures.

Plain English Explanation

The paper discusses the DSA Transparency Database, a project that examines how social media platforms report on their content moderation efforts. Social media companies often publish transparency reports that explain how they remove harmful or illegal content from their platforms. However, there are concerns that these reports may not tell the full story.

The researchers behind the DSA Transparency Database collect and analyze the data in these transparency reports. Their goal is to independently verify whether the platforms are accurately representing their content moderation practices. By cross-checking the claims in the transparency reports, the researchers hope to provide a more complete and reliable picture of how social media companies actually moderate user-generated content.

Technical Explanation

The paper describes the DSA Transparency Database, a project that collects and analyzes the content moderation transparency reports published by major social media platforms. The researchers aim to independently audit these self-reported disclosures in order to assess their accuracy and completeness.

The researchers gather data from the transparency reports, including statistics on content removals, account suspensions, and other enforcement actions taken by the platforms. They then perform comparative analyses to identify discrepancies or omissions in the platforms' reporting.

For example, the researchers may cross-reference the platforms' reported data on content removals with independent datasets on harmful online content. By comparing the two sources, they can determine whether the platforms are accurately reflecting the full scope of their content moderation efforts.

The goal of the DSA Transparency Database is to provide a more transparent and accountable view of social media companies' content policies and enforcement practices. By auditing the self-reported data, the researchers aim to identify any gaps or inconsistencies that may undermine the platforms' claimed commitment to community guidelines and content moderation.

Critical Analysis

The paper acknowledges several limitations and challenges in the DSA Transparency Database project. One key issue is the reliance on self-reported data from the platforms, which may be incomplete or inaccurate. The researchers note that their analyses are constrained by the information that platforms choose to disclose in their transparency reports.

Additionally, the researchers highlight the difficulty of obtaining comprehensive and reliable data on online content removals and account suspensions. Much of this information is not publicly available, and platforms may have incentives to underreport certain enforcement actions.

The paper also discusses the potential for selection bias, as the researchers can only analyze the data that is actually included in the transparency reports. There may be important moderation actions or content takedowns that are not captured in the reports.

Despite these limitations, the researchers argue that the DSA Transparency Database represents an important step towards greater accountability and transparency in social media content moderation. By providing independent verification and analysis, the project aims to hold platforms more accountable for their self-reported claims and disclosures.

Conclusion

The DSA Transparency Database is a novel initiative that seeks to audit the content moderation practices of major social media platforms. By collecting and analyzing the data in platforms' own transparency reports, the researchers hope to provide a more comprehensive and reliable picture of how these companies enforce their community guidelines and policies.

While the project faces some methodological challenges, the researchers believe it is a crucial step towards improving transparency and accountability in the digital services industry. By shining a light on any gaps or inconsistencies in platforms' self-reported data, the DSA Transparency Database could lead to more robust and trustworthy content moderation systems that better protect online users.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

The DSA Transparency Database: Auditing Self-reported Moderation Actions by Social Media

Amaury Trujillo, Tiziano Fagni, Stefano Cresci

Since September 2023, the Digital Services Act (DSA) obliges large online platforms to submit detailed data on each moderation action they take within the European Union (EU) to the DSA Transparency Database. From its inception, this centralized database has sparked scholarly interest as an unprecedented and potentially unique trove of data on real-world online moderation. Here, we thoroughly analyze all 353.12M records submitted by the eight largest social media platforms in the EU during the first 100 days of the database. Specifically, we conduct a platform-wise comparative study of their: volume of moderation actions, grounds for decision, types of applied restrictions, types of moderated content, timeliness in undertaking and submitting moderation actions, and use of automation. Furthermore, we systematically cross-check the contents of the database with the platforms' own transparency reports. Our analyses reveal that (i) the platforms adhered only in part to the philosophy and structure of the database, (ii) the structure of the database is partially inadequate for the platforms' reporting needs, (iii) the platforms exhibited substantial differences in their moderation actions, (iv) a remarkable fraction of the database data is inconsistent, (v) the platform X (formerly Twitter) presents the most inconsistencies. Our findings have far-reaching implications for policymakers and scholars across diverse disciplines. They offer guidance for future regulations that cater to the reporting needs of online platforms in general, but also highlight opportunities to improve and refine the database itself.

8/2/2024

Automated Transparency: A Legal and Empirical Analysis of the Digital Services Act Transparency Database

Rishabh Kaushal, Jacob van de Kerkhof, Catalina Goanta, Gerasimos Spanakis, Adriana Iamnitchi

The Digital Services Act (DSA) is a much awaited platforms liability reform in the European Union that was adopted on 1 November 2022 with the ambition to set a global example in terms of accountability and transparency. Among other obligations, the DSA emphasizes the need for online platforms to report on their content moderation decisions (`statements of reasons' - SoRs), which is a novel transparency mechanism we refer to as automated transparency in this study. SoRs are currently made available in the DSA Transparency Database, launched by the European Commission in September 2023. The DSA Transparency Database marks a historical achievement in platform governance, and allows investigations about the actual transparency gains, both at structure level as well as at the level of platform compliance. This study aims to understand whether the Transparency Database helps the DSA to live up to its transparency promises. We use legal and empirical arguments to show that while there are some transparency gains, compliance remains problematic, as the current database structure allows for a lot of discretion from platforms in terms of transparency practices. In our empirical study, we analyze a representative sample of the Transparency Database (131m SoRs) submitted in November 2023, to characterise and evaluate platform content moderation practices.

5/6/2024

🎯

Operationalizing content moderation accuracy in the Digital Services Act

Johnny Tian-Zheng Wei, Frederike Zufall, Robin Jia

The Digital Services Act, recently adopted by the EU, requires social media platforms to report the accuracy of their automated content moderation systems. The colloquial term is vague, or open-textured -- the literal accuracy (number of correct predictions divided by the total) is not suitable for problems with large class imbalance, and the ground truth and dataset to measure accuracy against is unspecified. Without further specification, the regulatory requirement allows for deficient reporting. In this interdisciplinary work, we operationalize accuracy reporting by refining legal concepts and relating them to technical implementation. We start by elucidating the legislative purpose of the Act to legally justify an interpretation of accuracy as precision and recall. These metrics remain informative in class imbalanced settings, and reflect the proportional balancing of Fundamental Rights of the EU Charter. We then focus on the estimation of recall, as its naive estimation can incur extremely high annotation costs and disproportionately interfere with the platform's right to conduct business. Through a simulation study, we show that recall can be efficiently estimated using stratified sampling with trained classifiers, and provide concrete recommendations for its application. Finally, we present a case study of recall reporting for a subset of Reddit under the Act. Based on the language in the Act, we identify a number of ways recall could be reported due to underspecification. We report on one possibility using our improved estimator, and discuss the implications and areas for further legal clarification.

6/4/2024

Watching the Watchers: A Comparative Fairness Audit of Cloud-based Content Moderation Services

David Hartmann, Amin Oueslati, Dimitri Staufer

Online platforms face the challenge of moderating an ever-increasing volume of content, including harmful hate speech. In the absence of clear legal definitions and a lack of transparency regarding the role of algorithms in shaping decisions on content moderation, there is a critical need for external accountability. Our study contributes to filling this gap by systematically evaluating four leading cloud-based content moderation services through a third-party audit, highlighting issues such as biases against minorities and vulnerable groups that may arise through over-reliance on these services. Using a black-box audit approach and four benchmark data sets, we measure performance in explicit and implicit hate speech detection as well as counterfactual fairness through perturbation sensitivity analysis and present disparities in performance for certain target identity groups and data sets. Our analysis reveals that all services had difficulties detecting implicit hate speech, which relies on more subtle and codified messages. Moreover, our results point to the need to remove group-specific bias. It seems that biases towards some groups, such as Women, have been mostly rectified, while biases towards other groups, such as LGBTQ+ and PoC remain.

6/21/2024