digitado

[P] We added semantic caching to Bifrost and it’s cutting API costs by 60-70%

digitado ⋅ 3 de February de 2026

Building Bifrost and one feature that’s been really effective is semantic caching. Instead of just exact string matching, we use embeddings to catch when users ask the same thing in different ways. How it works: when a request comes in, we generate an embedding and check if anything semantically similar exists in the cache. You can tune the similarity threshold – we default to 0.8 but you can go stricter (0.9+) or looser (0.7) depending on your use […]

Ver mais

Like 0

Liked Liked

technocracy

Causal Inference on Stopped Random Walks in Online Advertising

digitado ⋅ 6 de February de 2026

arXiv:2602.05997v1 Announce Type: new Abstract: We consider a causal inference problem frequently encountered in online advertising systems, where a publisher (e.g., Instagram, TikTok) interacts repeatedly with human users and advertisers by sporadically displaying to each user an advertisement selected through an auction. Each treatment corresponds to a parameter value of the advertising mechanism (e.g., auction reserve-price), and we want to estimate through experiments the corresponding long-term treatment effect (e.g., annual advertising revenue). In our setting, the treatment affects […]

Ver mais

Like 0

Liked Liked

technocracy

Discrete Adjoint Matching

digitado ⋅ 10 de February de 2026

arXiv:2602.07132v1 Announce Type: new Abstract: Computation methods for solving entropy-regularized reward optimization — a class of problems widely used for fine-tuning generative models — have advanced rapidly. Among those, Adjoint Matching (AM, Domingo-Enrich et al., 2025) has proven highly effective in continuous state spaces with differentiable rewards. Transferring these practical successes to discrete generative modeling, however, remains particularly challenging and largely unexplored, mainly due to the drastic shift in generative model classes to discrete state spaces, which are […]

Ver mais

Like 0

Liked Liked

technocracy

Endless Terminals: Scaling RL Environments for Terminal Agents

digitado ⋅ 26 de January de 2026

arXiv:2601.16443v1 Announce Type: new Abstract: Environments are the bottleneck for self-improving agents. Current terminal benchmarks were built for evaluation, not training; reinforcement learning requires a scalable pipeline, not just a dataset. We introduce Endless Terminals, a fully autonomous pipeline that procedurally generates terminal-use tasks without human annotation. The pipeline has four stages: generating diverse task descriptions, building and validating containerized environments, producing completion tests, and filtering for solvability. From this pipeline we obtain 3255 tasks spanning file operations, […]

Ver mais

Like 0

Liked Liked

technocracy

“DECEPTICON: How Dark Patterns Manipulate Web Agents”, Cuvin et al 2025

digitado ⋅ 10 de February de 2026

submitted by /u/gwern [link] [comments]

Ver mais

Like 0

Liked Liked

technocracy

Magnetic Field-Mediated Superconducting Logic

digitado ⋅ 10 de February de 2026

arXiv:2602.07146v1 Announce Type: new Abstract: While superconductors are highly attractive for energy-efficient computing, fundamental limitations in their logic circuit integration have hindered scaling and led to increased energy consumption. We therefore propose and experimentally demonstrate a novel superconducting switching device utilizing the proximity magnetization from a spin-orbit torque-switched magnet to control the resistivity of a superconductor. We further propose a complete logic family comprised solely of these devices. This novel implementation has the potential to drastically outperform existing […]

Ver mais

Like 0

Liked Liked

technocracy

Elsewise: Authoring AI-Based Interactive Narrative with Possibility Space Visualization

digitado ⋅ 23 de January de 2026

arXiv:2601.15295v1 Announce Type: new Abstract: Interactive narrative (IN) authors craft spaces of divergent narrative possibilities for players to explore, with the player’s input determining which narrative possibilities they actually experience. Generative AI can enable new forms of IN by improvisationally expanding on pre-authored content in response to open-ended player input. However, this extrapolation risks widening the gap between author-envisioned and player-experienced stories, potentially limiting the strength of plot progression and the communication of the author’s narrative intent. To […]

Ver mais

Like 0

Liked Liked

technocracy

From Gemma 3 270M to FunctionGemma, How Google AI Built a Compact Function Calling Specialist for Edge Workloads

digitado ⋅ 27 de December de 2025

Google has released FunctionGemma, a specialized version of the Gemma 3 270M model that is trained specifically for function calling and designed to run as an edge agent that maps natural language to executable API actions. But, What is FunctionGemma? FunctionGemma is a 270M parameter text only transformer based on Gemma 3 270M. It keeps the same architecture as Gemma 3 and is released as an open model under the Gemma license, but the training objective and chat […]

Ver mais

Like 0

Liked Liked

technocracy

Conservative lawmakers want porn taxes. Critics say they’re unconstitutional.

digitado ⋅ 10 de January de 2026

As age-verification laws continue to dismantle the adult industry—and determine the future of free speech on the internet—a Utah lawmaker proposed a bill this week that would enforce a tax on porn sites that operate within the state. Introduced by state senator Calvin Musselman, a Republican, the bill would impose a 7 percent tax on total receipts “from sales, distributions, memberships, subscriptions, performances, and content amounting to material harmful to minors that is produced, sold, filmed, generated, or […]

Ver mais

Like 0

Liked Liked

technocracy

Anytime Optimal Decision Tree Learning with Continuous Features

digitado ⋅ 21 de January de 2026

In recent years, significant progress has been made on algorithms for learning optimal decision trees, primarily in the context of binary features. Extending these methods to continuous features remains substantially more challenging due to the large number of potential splits for each feature. Recently, an elegant exact algorithm was proposed for learning optimal decision trees with continuous features; however, the rapidly increasing computational time limits its practical applicability to shallow depths (typically 3 or 4). It relies on […]

Ver mais

Like 0

Liked Liked