VLM as Policy: Common-Law Content Moderation Framework for Short Video Platform

Lu, Xingyu; Zhang, Tianke; Meng, Chang; Wang, Xiaobei; Wang, Jinpeng; Zhang, YiFan; Tang, Shisong; Liu, Changyi; Ding, Haojie; Jiang, Kaiyu; Tang, Kaiyu; Wen, Bin; Zheng, Hai-Tao; Yang, Fan; Gao, Tingting; Zhang, Di; Gai, Kun

Computer Science > Social and Information Networks

arXiv:2504.14904 (cs)

[Submitted on 21 Apr 2025]

Title:VLM as Policy: Common-Law Content Moderation Framework for Short Video Platform

Authors:Xingyu Lu, Tianke Zhang, Chang Meng, Xiaobei Wang, Jinpeng Wang, YiFan Zhang, Shisong Tang, Changyi Liu, Haojie Ding, Kaiyu Jiang, Kaiyu Tang, Bin Wen, Hai-Tao Zheng, Fan Yang, Tingting Gao, Di Zhang, Kun Gai

View PDF HTML (experimental)

Abstract:Exponentially growing short video platforms (SVPs) face significant challenges in moderating content detrimental to users' mental health, particularly for minors. The dissemination of such content on SVPs can lead to catastrophic societal consequences. Although substantial efforts have been dedicated to moderating such content, existing methods suffer from critical limitations: (1) Manual review is prone to human bias and incurs high operational costs. (2) Automated methods, though efficient, lack nuanced content understanding, resulting in lower accuracy. (3) Industrial moderation regulations struggle to adapt to rapidly evolving trends due to long update cycles. In this paper, we annotate the first SVP content moderation benchmark with authentic user/reviewer feedback to fill the absence of benchmark in this field. Then we evaluate various methods on the benchmark to verify the existence of the aforementioned limitations. We further propose our common-law content moderation framework named KuaiMod to address these challenges. KuaiMod consists of three components: training data construction, offline adaptation, and online deployment & refinement. Leveraging large vision language model (VLM) and Chain-of-Thought (CoT) reasoning, KuaiMod adequately models video toxicity based on sparse user feedback and fosters dynamic moderation policy with rapid update speed and high accuracy. Offline experiments and large-scale online A/B test demonstrates the superiority of KuaiMod: KuaiMod achieves the best moderation performance on our benchmark. The deployment of KuaiMod reduces the user reporting rate by 20% and its application in video recommendation increases both Daily Active User (DAU) and APP Usage Time (AUT) on several Kuaishou scenarios. We have open-sourced our benchmark at this https URL.

Comments:	20 pages, 6 figures
Subjects:	Social and Information Networks (cs.SI); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multimedia (cs.MM)
Cite as:	arXiv:2504.14904 [cs.SI]
	(or arXiv:2504.14904v1 [cs.SI] for this version)
	https://doi.org/10.48550/arXiv.2504.14904

Submission history

From: Xingyu Lu [view email]
[v1] Mon, 21 Apr 2025 07:20:19 UTC (21,504 KB)

Computer Science > Social and Information Networks

Title:VLM as Policy: Common-Law Content Moderation Framework for Short Video Platform

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Social and Information Networks

Title:VLM as Policy: Common-Law Content Moderation Framework for Short Video Platform

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators