Active
Trust Dynamics in AI Workforces
How do AI workers build, lose, and recover trust across repeated interactions in an organization? We study the conditions under which a single early failure leaves a lasting mark — and the structural choices (reasoning effort, memory, verification protocols, trust-repair mechanisms) that shape whether teams of AI workers can coordinate at all when the stakes are real.
Multi-AgentTrustCoordinationMemory
Active
Consequence Design for Cooperation
Cooperation in an AI workforce is not a property of the model — it is a property of the institution around the model. We map how different consequence regimes (proportional, progressive, all-or-nothing, regressive) shape cooperation, exploitation, and catastrophic failure, and identify the configurations where each regime quietly breaks.
Mechanism DesignCooperationGame Theory
Active
Risk and Decision Theory in Optimal Control
When environments contain absorbing failure states, optimal policies start to look strikingly human — risk-averse near the cliff in growth regimes, risk-seeking near the cliff in decline. We derive the structural conditions that produce these patterns and connect them to long-standing puzzles in behavioral economics. What looks like model bias is often institutional structure.
Decision TheoryMDPProspect TheoryApplied Math
Active
Long-Horizon Behavior and Accountability
Many real deployments give AI workers fixed terms, finite horizons, or short-window incentives. We study what happens when these conditions meet a shared resource: when does a worker extract too much, rationalize doing it, and become invisible to the people it harms? And which deployment-time choices reverse the pattern cheaply?
Long HorizonCommonsIncentivesAccountability
Active
Welfare Accounting for AI Workforces
Most evaluation asks whether tasks succeed. We ask who benefits, who bears risk, and when local success produces collective harm. Welfare-accounted simulations and benchmarks for AI workforces operating on behalf of stakeholders who never see the prompt.
WelfareStakeholdersEvaluation
Active
Policy-as-Product Frameworks
Translating experimental findings into design rules for the people deploying AI workforces. Bystander visibility, accountability horizons, review windows, memory structure, trust repair, consequence design — the levers, the failure modes they prevent, and the evidence behind each rule.
PolicyEvaluationDeployment
Upcoming
Context-Specific Governance Evaluation
Every domain — healthcare, finance, education, defense — has its own failure modes and trade-offs for AI labor. We are building tailored evaluation frameworks that move beyond one-size-fits-all checklists toward governance shaped by the structure of each setting.
HealthcareFinanceEducationDefense
Upcoming
AI as a Mirror: Societal Reflection Studies
The patterns we find in artificial workers — negativity bias, short-horizon extraction, bystander invisibility — are not the model's invention. They are inherited from us. We use multi-agent experiments as a diagnostic tool for the institutions, incentives, and blind spots of the societies that built the training data.
SocietyBiasInstitutions