What are alignment problems in AI?

AI Ethics & Safety

ethics

3 months ago

I keep seeing 'alignment' come up in safety discussions. Can someone explain what alignment means in this context?

3 months ago

Alignment refers to ensuring AI systems pursue the goals we actually want them to pursue, not just what we think we're telling them.

Brian Leach

3 months ago

It's about the difference between what we specify and what we actually want. Classic example: asking for paperclips and getting the universe converted to paperclips.

Shelly Lang

3 months ago

The challenge becomes even more complex as AI systems become more capable and autonomous.