Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
The Hot Mess of AI: How Does Misalignment Scale with Model Intelligence (arxiv.org)
1 point by schmuhblaster 52 minutes ago | past | discuss
GRP-Obliteration: Unaligning LLMs with a Single Unlabeled Prompt [pdf] (arxiv.org)
1 point by janandonly 53 minutes ago | past | discuss
Harmless reward hacks generalize to shutdown evasion and dictatorship in GPT-4.1 (arxiv.org)
1 point by toliveistobuild 6 hours ago | past | 1 comment
FullStack-Agent: Enhancing Agentic Full-Stack Web Coding (arxiv.org)
2 points by simonpure 22 hours ago | past | discuss
Lightweight Memory Construction with Dynamic Evolution for LLM Agents (arxiv.org)
2 points by PaulHoule 23 hours ago | past | discuss
Moltbook: Fast Response or Silence? (arxiv.org)
1 point by EagleEdge 1 day ago | past | discuss
Randomness in Agentic Evals (arxiv.org)
3 points by andre15silva 1 day ago | past | discuss
Large Language Model Reasoning Failures (arxiv.org)
1 point by mpweiher 1 day ago | past | discuss
We Should Separate Memorization from Copyright (arxiv.org)
1 point by 50kIters 1 day ago | past | discuss
Towards a Standard for JSON Document Databases (arxiv.org)
1 point by ingve 1 day ago | past | discuss
Security audit of Browser Use: prompt injection, credential exfil, domain bypass (arxiv.org)
2 points by tiny-automates 1 day ago | past | 1 comment
Frontier AI agents violate ethical constraints 30–50% of time, pressured by KPIs (arxiv.org)
530 points by tiny-automates 1 day ago | past | 354 comments
Simone Weil, André Weil, Bourbaki and Pythagorean Mathematics (arxiv.org)
2 points by bikenaga 1 day ago | past | 1 comment
Large Language Model Reasoning Failures (arxiv.org)
3 points by belter 1 day ago | past | discuss
Shared LoRA Subspaces for Almost Strict Continual Learning (arxiv.org)
1 point by unisub_guy 1 day ago | past | 1 comment
Towards Understanding What State Space Models Learn About Code (arxiv.org)
1 point by belter 1 day ago | past | discuss
Code Formatting Silently Consumes Your LLM Budget (arxiv.org)
1 point by mustaphah 1 day ago | past | discuss
Ernie 5.0 Technical Report (arxiv.org)
2 points by salkahfi 1 day ago | past | discuss
The Case for Contextual Copyleft: Licensing Open Source Training Data and Gener (arxiv.org)
1 point by todsacerdoti 1 day ago | past | discuss
A handy method for hazards detection in an IS of a pipelined processor [pdf] (arxiv.org)
1 point by liungrin 2 days ago | past | discuss
Causal World Modeling for Robot Control (arxiv.org)
1 point by mountainview 2 days ago | past | discuss
VL-JEPA: Joint Embedding Predictive Architecture for Vision-Language (arxiv.org)
2 points by andsoitis 2 days ago | past | discuss
The extent of computation in Malament-Hogarth spacetimes (2006) (arxiv.org)
2 points by gone35 2 days ago | past | discuss
Nonreciprocal wave-mediated interactions power a classical time crystal (arxiv.org)
2 points by rbanffy 2 days ago | past | discuss
Shifts in U.S. Social Media Use, 2020–2024: Decline, Fragmentation, Polarization (2025) (arxiv.org)
212 points by vinnyglennon 2 days ago | past | 207 comments
SOK: On the Analysis of Web Browser Security (2021) (arxiv.org)
1 point by walterbell 2 days ago | past | discuss
Evaluating TCP BBRv2 on the Dropbox edge network (arxiv.org)
1 point by fanf2 2 days ago | past | discuss
Training Foundation Models Directly on Human Brain Data (arxiv.org)
1 point by helloplanets 3 days ago | past | discuss
Open Problems in Mechanistic Interpretability (arxiv.org)
2 points by vinhnx 3 days ago | past | discuss
Psychometric Comparability of LLM-Based Digital Twins (arxiv.org)
1 point by PaulHoule 3 days ago | past | discuss

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: