digitado

[R] Benchmarking Reward Hack Detection in Code Environments via Contrastive Analysis

digitado ⋅ 29 de January de 2026

{“document”:[{“e”:”par”,”c”:[{“e”:”text”,”t”:”Recent advances in reinforcement learning for code generation have made robust environments essential to prevent reward hacking. As LLMs increasingly serve as evaluators in code-based RL, their ability to detect reward hacking remains understudied. In this paper, we propose a novel taxonomy of reward exploits spanning across 54 categories and introduce TRACE (Testing Reward Anomalies in Code Environments), a synthetically curated and human-verified benchmark containing 517 testing trajectories. Unlike prior work that evaluates reward hack detection in isolated […]

Ver mais

Like 0

Liked Liked

technocracy

Enhanced Diffusion Sampling: Efficient Rare Event Sampling and Free Energy Calculation with Diffusion Models

digitado ⋅ 19 de February de 2026

arXiv:2602.16634v1 Announce Type: new Abstract: The rare-event sampling problem has long been the central limiting factor in molecular dynamics (MD), especially in biomolecular simulation. Recently, diffusion models such as BioEmu have emerged as powerful equilibrium samplers that generate independent samples from complex molecular distributions, eliminating the cost of sampling rare transition events. However, a sampling problem remains when computing observables that rely on states which are rare in equilibrium, for example folding free energies. Here, we introduce enhanced […]

Ver mais

Like 0

Liked Liked

technocracy

Why 2026 is the Year Healthcare Finally Hires AI Agents

digitado ⋅ 23 de January de 2026

The medical industry is finally stopping the charade of treating software as a tool and starting to treat AI agents as colleagues. We have moved past the era of passive Large Language Models (LLMs) that act like fancy encyclopedias. The current landscape is defined by agentic AI, digital entities that do not just suggest; they execute. If 2024 was about the “ambient scribe” that sat in the corner recording conversations, 2026 is about the “hireable agent” that navigates […]

Ver mais

Like 0

Liked Liked

technocracy

AR&D: A Framework for Retrieving and Describing Concepts for Interpreting AudioLLMs

digitado ⋅ 27 de February de 2026

arXiv:2602.22253v1 Announce Type: new Abstract: Despite strong performance in audio perception tasks, large audio-language models (AudioLLMs) remain opaque to interpretation. A major factor behind this lack of interpretability is that individual neurons in these models frequently activate in response to several unrelated concepts. We introduce the first mechanistic interpretability framework for AudioLLMs, leveraging sparse autoencoders (SAEs) to disentangle polysemantic activations into monosemantic features. Our pipeline identifies representative audio clips, assigns meaningful names via automated captioning, and validates concepts […]

Ver mais

Like 0

Liked Liked

technocracy

I’ve Seen This IP: A Practical Intersection Attack Against Tor Introduction Circuits and Hidden Services

digitado ⋅ 2 de March de 2026

arXiv:2602.23560v1 Announce Type: new Abstract: Tor onion services rely on long-lived introduction circuits to support anonymous rendezvous between clients and services. Although Tor includes some defenses against traffic analysis, the introduction protocol retains deterministic routing structure that can be leveraged by an adversary. We describe a practical intersection attack on Tor introduction circuits that can, over repeated interactions, identify each hop from the introduction point toward the onion service while requiring observation at only one relay per stage. […]

Ver mais

Like 0

Liked Liked

technocracy

GRAFNet: Multiscale Retinal Processing via Guided Cortical Attention Feedback for Enhancing Medical Image Polyp Segmentation

digitado ⋅ 18 de February de 2026

arXiv:2602.15072v1 Announce Type: new Abstract: Accurate polyp segmentation in colonoscopy is essential for cancer prevention but remains challenging due to: (1) high morphological variability (from flat to protruding lesions), (2) strong visual similarity to normal structures such as folds and vessels, and (3) the need for robust multi-scale detection. Existing deep learning approaches suffer from unidirectional processing, weak multi-scale fusion, and the absence of anatomical constraints, often leading to false positives (over-segmentation of normal structures) and false negatives […]

Ver mais

Like 0

Liked Liked

technocracy

Sunday Scares and Data Leadership: The Pattern That Breaks Us

digitado ⋅ 8 de January de 2026

Sunday panic attacks are the hidden reality for many data leaders. This story explores the “savior complex,” the cycle of burnout, and why we struggle to find rhythm. It challenges the idea that better tools can fix personal patterns and asks the hard question: Can we do transformational work without destroying ourselves? You aren’t broken; you just care too much without boundaries.

Ver mais

Like 0

Liked Liked

technocracy

I Tested a 7B Model That Beat Models 7× Its Size. Here’s What I Found.

digitado ⋅ 15 de January de 2026

Author(s): Adham Khaled Originally published on Towards AI. The Falcon-H1R doesn’t make sense on paper. Until you understand what UAE’s TII actually built. Last Saturday, I downloaded a model that shouldn’t exist. 7 billion parameters. Open-source. From Abu Dhabi. On paper, it’s nothing special. The AI world runs on models 10×, 20×, even 100× this size. Then I ran the benchmarks. AIME-24 mathematics: 88.1%. That’s better than ServiceNow’s Apriel 1.5 — a 15-billion parameter model that scored 86.2%. […]

Ver mais

Like 0

Liked Liked

technocracy

Building a Production Multi-Tenant WhatsApp AI Bot: One Backend, Three Businesses

digitado ⋅ 2 de March de 2026

How I designed a single Python backend that serves a real estate agency in Dubai, a dental practice in Brazil, and a food retailer — each with fully isolated AI behavior, context memory, and business logic — without any off-the-shelf automation tools. Most WhatsApp AI tutorials show you how to build a bot for one use case. You connect to the API, craft a system prompt, call OpenAI, send a reply. It works. But it doesn’t scale. When I started building AI automation […]

Ver mais

Like 0

Liked Liked

technocracy

[D] Monthly Who’s Hiring and Who wants to be Hired?

digitado ⋅ 31 de January de 2026

For Job Postings please use this template Hiring: [Location], Salary:[], [Remote | Relocation], [Full Time | Contract | Part Time] and [Brief overview, what you’re looking for] For Those looking for jobs please use this template Want to be Hired: [Location], Salary Expectation:[], [Remote | Relocation], [Full Time | Contract | Part Time] Resume: [Link to resume] and [Brief overview, what you’re looking for] Please remember that this community is geared towards those with experience. submitted by […]

Ver mais

Like 0

Liked Liked