Skip to content

Home
About
Products
Blog
News
Contact Us

Menu

Home
About
Products
Blog
News
Contact Us

Search

AI Alignment Forum

AI Alignment Forum

LLM chatbots have ~half of the kinds of "consciousness" that humans believe in. Humans should avoid going crazy about that.
Intrinsic Power-Seeking: AI Might Seek Power for Power’s Sake
Training AI agents to solve hard problems could lead to Scheming
Why imperfect adversarial robustness doesn't doom AI control
Cross-context abduction: LLMs make inferences about procedural training data leveraging declarative facts in earlier training data
Which evals resources would be good?
Win/continue/lose scenarios and execute/replace/audit protocols
Evolutionary prompt optimization for SAE feature visualization
AXRP Episode 38.0 - Zhijing Jin on LLMs, Causality, and Multi-Agent Systems
o1 is a bad idea

Quick Links

Home
About
Products
Blog
News
Contact Us

Menu

Home
About
Products
Blog
News
Contact Us

Useful Links

Terms & Conditions
Privacy Policy
Disclaimer

As an Amazon Associate, we may earn commissions from qualifying purchases from Amazon.com

Copyright © 2023 – All rights reserved.

Newsletter

Join our newsletter to get the free update, insight, promotions.

Your Name

your email address