X, formerly known as Twitter, is piloting an AI-powered approach to its Community Notes feature, aiming to enhance the platform's information integrity. This initiative involves using AI chatbots, including X's own Grok and third-party tools, to generate notes that provide context to posts. Community Notes is a user-driven fact-checking system that allows participants to contribute notes that clarify or contextualize posts. These notes undergo review by other users before publication.
The pilot program will evaluate AI-generated notes using the same vetting process as human-written notes to ensure accuracy. A research paper from X's Community Notes team describes the system's goal to combine human feedback with AI-generated content, highlighting a collaborative approach where Large Language Models (LLMs) and human reviewers work together. Human reviewers will provide final approval for notes before they are published. X plans to evaluate the pilot program over the coming weeks before deciding on a broader release.
The introduction of AI-generated Community Notes marks a significant step in the evolution of social media platforms. By utilizing AI, X aims to streamline the process of generating informative notes, making it more efficient and scalable. The platform emphasizes that human oversight will remain a crucial component, ensuring that the AI-generated content is accurate and contextually appropriate. This approach addresses concerns about the reliability of AI-generated information and underscores the importance of human judgment in maintaining the integrity of the platform.
X's Community Notes feature has demonstrated a substantial impact on limiting the spread of misinformation. A February 2025 study found that attaching community notes to misleading content resulted in a 45.7% reduction in reposts and a 43.5% drop in likes. These notes alter how information spreads by disrupting the viral diffusion patterns that typically help misinformation reach wide audiences. The effectiveness explains why platforms like Meta, TikTok, and YouTube are developing similar community-based fact-checking systems. The timing of note attachment is crucial; faster placement increases their effectiveness at limiting the spread of misleading content. This suggests X's plan to use AI to potentially generate notes more quickly could enhance the system, provided the AI-generated content maintains accuracy.
The use of AI in Community Notes also aims to address the challenge of scalability in fact-checking. Historically, fact-checking on social media has been a labor-intensive task, often reliant on a small team of human moderators. By employing AI agents to assist in the generation of Community Notes, X aims to streamline this process. The introduction of AI is expected to not only increase the speed at which notes are produced but also enhance the quality of information disseminated across the platform.
To participate, developers need to sign up for both the X API and the AI Note Writer API. Each AI Note Writer must pass an admission threshold based on feedback from an open-source evaluator trained on historical contributor data. Only notes from admitted AI writers can be surfaced to the broader community. At launch, AI-written notes will be marked distinctly and held to the same transparency, quality, and fairness standards as human-written ones.
Despite the potential benefits, there are also concerns about AI's tendency to hallucinate, where it may generate inaccurate or fabricated information instead of grounded fact-checks. Over-reliance on AI, particularly models prone to excessive helpfulness rather than accuracy, could lead to incorrect notes slipping through. There are also fears that human raters could become overwhelmed by a flood of AI submissions, reducing the overall quality of the system. The success of this initiative will largely depend on how well X can manage these concerns and ensure that the integrity of the information remains intact. The platform's ability to maintain the accuracy and reliability of AI-generated content will be crucial in determining the effectiveness of this new feature.