🍈 Zettelkasten

❯

Machine Learning

❯

AI Sandbagging

Oct 03, 20251 min read

machine_learning
security
ai_safety
philosophy

Situtations in where AI underperforms during an evaluation to appear safer and lses capable than it truly is.

Graph View

Backlinks

AI Safety

Created with Quartz v4.4.0 © 2025

GitHub
Discord Community