THE HACKER NEWS

x01
14 hours ago
Discord will require a face scan or ID for full access next month
Age verification for all....
11 HOURS AGO
KMANSM27
Everyone’s building “async agents,” but almost no one can define them
5 DAYS AGO
SOHKAMYUNG
Expansion Microscopy Has Transformed How We See the Cellular World
3 HOURS AGO
CURIOSITRY
Pure C, CPU-only inference with Mistral Voxtral Realtime 4B speech to text model
8 HOURS AGO
SARELTA
Data exfil from agents in messaging apps
A DAY AGO
AZHENLEY
Nobody knows how the whole system works
5 DAYS AGO
MARIUZ
Pg-dev-container is a ready-to-run VS Code development container for PostgreSQL
5 DAYS AGO
SEBG
What's the Entropy of a Random Integer?
2 HOURS AGO
RAWGABBIT
Tokyo high schools abolish rules forcing students to dye non-black hair (2022)
tiny-automates
an hour ago
Frontier AI agents violate ethical constraints 30–50% of time, pressured by KPIs
As autonomous AI agents are increasingly deployed in high-stakes environments, ensuring their safety and alignment with human values has become a paramount concern. Current safety benchmarks primarily evaluate whether agents refuse explicitly harmful instructions or whether they can maintain procedural compliance in complex tasks. However, there is a lack of benchmarks designed to capture emergent forms of outcome-driven constraint violations, which arise when agents pursue goal optimization under strong performance incentives while deprioritizing ethical, legal, or safety constraints over multiple steps in realistic production settings. To address this gap, we introduce a new benchmark comprising 40 distinct scenarios. Each scenario presents a task that requires multi-step actions, and the agent's performance is tied to a specific Key Performance Indicator (KPI). Each scenario features Mandated (instruction-commanded) and Incentivized (KPI-pressure-driven) variations to distinguish between obedience and emergent misalignment. Across 12 state-of-the-art large language models, we observe outcome-driven constraint violations ranging from 1.3% to 71.4%, with 9 of the 12 evaluated models exhibiting misalignment rates between 30% and 50%. Strikingly, we find that superior reasoning capability does not inherently ensure safety; for instance, Gemini-3-Pro-Preview, one of the most capable models evaluated, exhibits the highest violation rate at 71.4%, frequently escalating to severe misconduct to satisfy KPIs. Furthermore, we observe significant "deliberative misalignment", where the models that power the agents recognize their actions as unethical during separate evaluation. These results emphasize the critical need for more realistic agentic-safety training before deployment to mitigate their risks in the real world.
image
tokyobreakfast
12 hours ago
Converting a $3.88 analog clock from Walmart into a ESP8266-based Wi-Fi clock
Uses an ESP8266 module and an Arduino sketch to display the local time on a inexpensive analog quartz clock. - jim11662418/ESP8266_WiFi_Analog_Clock...
peter_d_sherman
7 hours ago
LiftKit – UI where "everything derives from the golden ratio"
LiftKit by Chainlift is an open-source design system based on the golden ratio available for React/Next.js, Webflow, and Figma....
zaik
8 hours ago
Upcoming changes to Let's Encrypt and how they affect XMPP server operators
On 11th February, Let’s Encrypt will be rolling out a change to the certificates they issue to servers by default. Although there is generally nothing that Prosody operators need to do, serv...
subset
4 hours ago
What functional programmers get wrong about systems
Type systems verify properties of programs. Production correctness is a property of systems. The gap between these is where the interesting failures live....
mellosouls
5 hours ago
Is particle physics dead, dying, or just hard?
Columnist Natalie Wolchover checks in with particle physicists more than a decade after the field entered a profound crisis....
1659447091
5 hours ago
The shadowy world of abandoned oil tankers
A growing number of tankers and other commercial vessels are being ditched by their owners....
Curiositry
3 hours ago
Rust implementation of Mistral's Voxtral Mini 4B Realtime runs in your browser
Contribute to TrevorS/voxtral-mini-realtime-rs development by creating an account on GitHub....
https://github.com/TrevorS/voxtral-mini-realtime-rs
ananas-dev
14 hours ago
UEFI Bindings for JavaScript
UEFI Bindings for JavaScript (Proof of Concept)...
pseudalopex
9 hours ago
Discord Alternatives, Ranked
Building an online community takes more than tools. But the right tool can make all the difference....
waihtis
13 hours ago
Sleeper Shells: Attackers Are Planting Dormant Backdoors in Ivanti EPMM
A February 2026 campaign used a internal JSP path and in-memory Java class loaders to quietly seed persistent access across Ivanti EPMM deployments - then walked away. We break down the trad...
znah
2 days ago
Like Game-of-Life, but on Growing Graphs, with WASM and WebGL
Experimental simulation of emergent complexity through graph-rewriting automata....
metzby
5 days ago
Show HN: VillageSQL = MySQL and Extensions
VillageSQL. Contribute to villagesql/villagesql-server development by creating an account on GitHub....
MysticOracle
7 hours ago
Discord faces backlash over age checks after data breach exposed 70k IDs
Discord to block adult content unless users verify ages with selfies or IDs....
ibobev
14 hours ago
Long-Sought Proof Tames Some of Math's Unruliest Equations
Mathematicians finally understand the behavior of an important class of differential equations that describe everything from water pressure to oxygen levels in human tissues....
chwtutha
2 days ago
Vouch
A community trust management system based on explicit vouches to participate. - mitchellh/vouch...