Ethics - Government AI Control

A National Policy Framework for Artificial Intelligence: Legislative Recommendations

On: March 20, 2026

This White House document presents legislative recommendations for AI governance across issues including child safety, free speech, intellectual property, and protections for communities. It frames ethics at the public-policy level by identifying areas where legal standards and accountability structures may be needed for AI deployment. The document is materially distinct from technical safety work because it focuses on governance mechanisms and legislative action.

Sources:

A National Policy Framework for Artificial Intelligence: Legislative Recommendations

The Anthropic Institute

On: March 19, 2026

Anthropic announced an institute focused on the societal impacts of powerful AI, including research on AI values and how systems behave in the wild. The launch signals an effort to build dedicated institutional capacity around ethical and social questions, not just model capability and safety engineering. Its relevance to ethics lies in connecting technical system behavior with broader public-interest research.

Sources:

The Anthropic Institute

Frontier Safety Roadmap

On: February 19, 2026

This roadmap outlines Anthropic’s priorities for frontier AI safety, including oversight, governance, and security measures for advanced systems. While broader than ethics alone, it directly addresses ethical deployment by describing institutional and technical controls meant to reduce harm from increasingly capable models. It is useful as a policy-oriented reference for how a major lab frames responsible development at the frontier.

Sources:

Frontier Safety Roadmap

MoralityGym: A Benchmark for Evaluating Hierarchical Moral Alignment in Sequential Decision-Making Agents

On: February 13, 2026

This paper introduces MoralityGym, a benchmark for testing how well AI agents handle moral tradeoffs across sequential decision-making tasks rather than isolated prompts. It focuses on hierarchical moral alignment, where choices unfold over time in ethical-dilemma environments and can reveal failures that static evaluations may miss. The work is relevant to AI ethics because it offers a structured way to measure whether agent behavior stays aligned under longer-horizon decisions.

Sources:

MoralityGym: A Benchmark for Evaluating Hierarchical Moral Alignment in Sequential Decision-Making Agents

Values in the Wild

On: February 01, 2026

Anthropic’s paper studies how AI systems express values in real-world use and presents a taxonomy built from thousands of observed value expressions. Rather than defining values only in theory, it examines the patterns that emerge when systems interact with users in practice. The result is a grounded view of how AI values appear outside the lab, which can inform evaluation, governance, and product design.

Sources:

Values in the Wild