digitado

Training large language models more efficiently

digitado ⋅ 27 de March de 2025

Training large language models more efficiently Training separate models on different datasets and then merging them reduces computational costs by as much as 91%. Conversational AI Dhananjay Ram Nikolaos Pappas March 27, 01:10 PM March 27, 01:10 PM Large language models (LLMs) go through several stages of training on mixed datasets with different distributions, stages that include pretraining, instruction tuning, and reinforcement learning from human feedback. Finding the optimal mix of data distributions across datasets is essential to […]

Ver mais

Like 0

Liked Liked

technocracy

Residual Cross-Modal Fusion Networks for Audio-Visual Navigation

digitado ⋅ 15 de January de 2026

arXiv:2601.08868v1 Announce Type: new Abstract: Audio-visual embodied navigation aims to enable an agent to autonomously localize and reach a sound source in unseen 3D environments by leveraging auditory cues. The key challenge of this task lies in effectively modeling the interaction between heterogeneous features during multimodal fusion, so as to avoid single-modality dominance or information degradation, particularly in cross-domain scenarios. To address this, we propose a Cross-Modal Residual Fusion Network, which introduces bidirectional residual interactions between audio and […]

Ver mais

Like 0

Liked Liked

technocracy

Explainable AI needs formalization

digitado ⋅ 12 de January de 2026

arXiv:2409.14590v4 Announce Type: replace-cross Abstract: The field of “explainable artificial intelligence” (XAI) seemingly addresses the desire that decisions of machine learning systems should be human-understandable. However, in its current state, XAI itself needs scrutiny. Popular methods cannot reliably answer relevant questions about ML models, their training data, or test inputs, because they systematically attribute importance to input features that are independent of the prediction target. This limits the utility of XAI for diagnosing and correcting data and models, […]

Ver mais

Like 0

Liked Liked

technocracy

Structured Prediction Cascades

digitado ⋅ 31 de March de 2010

Structured prediction tasks pose a fundamental trade-off between the need for model complexity to increase predictive power and the limited computational resources for inference in the exponentially-sized output spaces such models require. We formulate and develop structured prediction cascades: a sequence of increasingly complex models that progressively filter the space of possible outputs. We represent an exponentially large set of filtered outputs using max marginals and propose a novel convex loss function that balances filtering error with filtering […]

Ver mais

Like 0

Liked Liked

technocracy

Godot 4.4: Metal Rendering Backend, 3D Physics Interpolation, and More

digitado ⋅ 31 de January de 2026

This is the first dev snapshot for 4.4! During the beta and release candidate stages of 4.3, we accumulated a lot of PRs that were of great quality, but deemed too risky to include in 4.3. We have begun merging those PRs now and have quickly gathered a lot of changes that warrant a dev release! Many of the changes in this release are bug fixes that will be backported to Godot 4.3 and released in 4.3.1! So […]

Ver mais

Like 0

Liked Liked

technocracy

Limits of Self-Correction in LLMs: An Information-Theoretic Analysis of Correlated Errors

digitado ⋅ 13 de January de 2026

Recent empirical work shows that large language models struggle to self-correct reasoning without external feedback. We propose a possible explanation: correlated error between generator and evaluator. When both components share failure modes, self-evaluation may provide weak evidence of correctness, and repeated self-critique may amplify confidence without adding information. We formalize this with two information-theoretic bounds. We then describe a practical architecture pairing high-entropy proposal generation with low-entropy external selection. This suggests an alternative to extended chain-of-thought in a […]

Ver mais

Like 0

Liked Liked

technocracy

Simulation-based Bayesian inference with ameliorative learned summary statistics — Part I

digitado ⋅ 2 de February de 2026

arXiv:2601.22441v1 Announce Type: new Abstract: This paper, which is Part 1 of a two-part paper series, considers a simulation-based inference with learned summary statistics, in which such a learned summary statistic serves as an empirical-likelihood with ameliorative effects in the Bayesian setting, when the exact likelihood function associated with the observation data and the simulation model is difficult to obtain in a closed form or computationally intractable. In particular, a transformation technique which leverages the Cressie-Read discrepancy criterion […]

Ver mais

Like 0

Liked Liked

technocracy

Not All Negative Samples Are Equal: LLMs Learn Better from Plausible Reasoning

digitado ⋅ 3 de February de 2026

Learning from negative samples holds great promise for improving Large Language Model (LLM) reasoning capability, yet existing methods treat all incorrect responses as equally informative, overlooking the crucial role of sample quality. To address this, we propose Plausible Negative Samples (PNS), a method that synthesizes high-quality negative samples exhibiting expected format and structural coherence while ultimately yielding incorrect answers. PNS trains a dedicated model via reverse reinforcement learning (RL) guided by a composite reward combining format compliance, accuracy […]

Ver mais

Like 0

Liked Liked

technocracy

Learning Personalized Agents from Human Feedback

digitado ⋅ 18 de February de 2026

Modern AI agents are powerful but often fail to align with the idiosyncratic, evolving preferences of individual users. Prior approaches typically rely on static datasets, either training implicit preference models on interaction history or encoding user profiles in external memory. However, these approaches struggle with new users and with preferences that change over time. We introduce Personalized Agents from Human Feedback (PAHF), a framework for continual personalization in which agents learn online from live interaction using explicit per-user […]

Ver mais

Like 0

Liked Liked

technocracy

Out-of-distribution generalization of deep-learning surrogates for 2D PDE-generated dynamics in the small-data regime

digitado ⋅ 13 de January de 2026

Partial differential equations (PDEs) are a central tool for modeling the dynamics of physical, engineering, and materials systems, but high-fidelity simulations are often computationally expensive. At the same time, many scientific applications can be viewed as the evolution of spatially distributed fields, making data-driven forecasting of such fields a core task in scientific machine learning. In this work we study autoregressive deep-learning surrogates for two-dimensional PDE dynamics on periodic domains, focusing on generalization to out-of-distribution initial conditions within […]

Ver mais

Like 0

Liked Liked