Amazon Abandons AI Usage Rankings After Employees Game the System

Amazon has encountered an unexpected challenge following its aggressive push to integrate artificial intelligence throughout its corporate operations. According to a recent report by the Financial Times, the tech giant was forced to discontinue an internal ranking system that tracked employee usage of its proprietary AI platform, Kiro. The decision came after workers began deliberately burning through AI tokens simply to climb the company’s leaderboard, turning a productivity tool into a competitive game with questionable benefits.

The situation highlights a growing tension in the corporate world between encouraging AI adoption and maintaining genuine productivity gains. Amazon had implemented the ranking system as part of a broader initiative to accelerate AI integration across its workforce, hoping that gamification would motivate employees to embrace the new technology. Instead, the company discovered that metrics designed to measure engagement were being manipulated, with staff members running unnecessary queries and consuming computational resources without any meaningful work output.

This development comes at a particularly interesting time for Amazon, which has been investing billions of dollars in artificial intelligence infrastructure and development. The company has positioned itself as a major player in the AI race, competing directly with Microsoft, Google, and other tech giants for dominance in the rapidly evolving market. Amazon Web Services, the company’s cloud computing division, has been aggressively marketing AI tools to enterprise customers while simultaneously rolling out internal AI solutions to boost employee productivity. The Kiro platform was intended to be a showcase of Amazon’s commitment to AI-first operations.

Industry analysts have noted that Amazon’s experience reflects a common pitfall in corporate AI adoption strategies. When companies tie incentives directly to usage metrics rather than outcomes, they risk creating perverse incentives that undermine the very goals they seek to achieve. Dr. Sarah Mitchell, a technology policy researcher at Stanford University, has previously warned that “measuring AI adoption by consumption rather than impact is like judging a diet by how much food you eat rather than your health outcomes.” This phenomenon, sometimes called “metric gaming” or “Goodhart’s Law” in action, occurs when a measure becomes a target and ceases to be a good measure.

The history of corporate gamification offers numerous cautionary tales. Wells Fargo’s infamous sales quota scandal, where employees opened millions of unauthorized accounts to meet aggressive targets, stands as a stark reminder of how misaligned incentives can lead to destructive behaviors. While Amazon’s AI leaderboard issue is far less severe, it demonstrates similar underlying dynamics. Employees, when faced with metrics that affect their standing or perceived performance, will often optimize for the metric itself rather than the intended outcome.

Amazon’s workforce has historically operated under intense performance pressure, with the company’s data-driven management culture being both praised for efficiency and criticized for creating stressful working conditions. The AI ranking system appears to have added another layer of competition to an already high-pressure environment. Some employees reportedly felt compelled to participate in the token-burning behavior simply to avoid appearing as laggards in AI adoption, regardless of whether the technology actually improved their work.

The broader implications of this incident extend beyond Amazon’s internal operations. As companies worldwide rush to implement AI solutions, many are grappling with how to measure success and encourage adoption without creating counterproductive incentives. Microsoft, Google, and countless other organizations have launched similar initiatives to embed AI into daily workflows, and they will likely be watching Amazon’s experience closely. The lesson appears clear: sustainable AI adoption requires focusing on genuine productivity improvements and employee empowerment rather than simple usage statistics.

Moving forward, Amazon will need to develop more sophisticated approaches to measuring AI’s impact on its operations. Industry experts suggest that successful AI integration programs should focus on qualitative assessments, workflow improvements, and actual business outcomes rather than raw consumption metrics. The company has not publicly commented on what replacement system, if any, it plans to implement. However, this episode serves as a valuable case study for the entire technology industry as it navigates the complex transition to AI-augmented workplaces.