Alignment faking in large language models

Article URL: https://www.anthropic.com/research/alignment-faking

Comments URL: https://news.ycombinator.com/item?id=42458752

Points: 63

# Comments: 35

https://www.anthropic.com/research/alignment-faking

Creată 1mo | 19 dec. 2024, 08:10:05

Autentifică-te pentru a adăuga comentarii

Alte posturi din acest grup

File Explorer is merged to Helix editor

File Explorer is merged to Helix editor

Article URL: https://github.com/helix-editor/helix/pull/11285

Comments URL:

25 ian. 2025, 03:30:13 | Hacker news

Android SMS Gateway Using MQTT

Android SMS Gateway Using MQTT

Article URL: https://github.com/ibnux/Android-SMS-Gateway-MQTT

Comments URL:

25 ian. 2025, 03:30:12 | Hacker news

Show HN: I recovered one of my earliest ZX-Spectrum games from an audio cassette

Show HN: I recovered one of my earliest ZX-Spectrum games from an audio cassette

Recently, I managed to recover some of my earliest work from the ZX Spectrum era from an old audio cassette.

It is a mini-game, that I created as a teen, called "Atomix," written in BASIC with a

25 ian. 2025, 03:30:10 | Hacker news

Frustrated YouTube viewers seek explanation for hour-long unskippable ads

Frustrated YouTube viewers seek explanation for hour-long unskippable ads

Article URL: https://www.androidauthority.com/youtube-long-unskippable-ads-problem-3519957/

Comm

25 ian. 2025, 03:30:10 | Hacker news

Caltrain's Electric Fleet More Efficient Than Expected

Caltrain's Electric Fleet More Efficient Than Expected

Article URL: https://www.caltrain.com/news/caltrains-electric-fleet-more-efficient-expected

Comm

25 ian. 2025, 03:30:09 | Hacker news

Kidnappers sever finger of cryptocurrency millionaire David Balland

Kidnappers sever finger of cryptocurrency millionaire David Balland

Article URL: https://moneycheck.com/french-police-free-kidnapped-ledger-executive-after-day-lo

25 ian. 2025, 03:30:08 | Hacker news

Nuclear Proliferation and the "Nth Country Experiment"

Nuclear Proliferation and the "Nth Country Experiment"

Article URL: https://nsarchive.gwu.edu/briefing-book/nuclear-vault/2025-

25 ian. 2025, 01:20:06 | Hacker news

Techie