digitado

Compressed Sensing for Capability Localization in Large Language Models

digitado ⋅ 5 de March de 2026

arXiv:2603.03335v1 Announce Type: new Abstract: Large language models (LLMs) exhibit a wide range of capabilities, including mathematical reasoning, code generation, and linguistic behaviors. We show that many capabilities are highly localized to small subsets of attention heads within Transformer architectures. Zeroing out as few as five task-specific heads can degrade performance by up to $65%$ on standard benchmarks measuring the capability of interest, while largely preserving performance on unrelated tasks. We introduce a compressed sensing based method that […]

Ver mais

Like 0

Liked Liked

technocracy

Introducing GPT‑5.4

digitado ⋅ 6 de March de 2026

Introducing GPT‑5.4 Two new API models: gpt-5.4 and gpt-5.4-pro, also available in ChatGPT and Codex CLI. August 31st 2025 knowledge cutoff, 1 million token context window. Priced slightly higher than the GPT-5.2 family with a bump in price for both models if you go above 272,000 tokens. 5.4 beats coding specialist GPT-5.3-Codex on all of the relevant benchmarks. I wonder if we’ll get a 5.4 Codex or if that model line has now been merged into main? Given […]

Ver mais

Like 0

Liked Liked

technocracy

New Trends in the Stability of Sinkhorn Semigroups

digitado ⋅ 21 de January de 2026

arXiv:2601.12633v1 Announce Type: cross Abstract: Entropic optimal transport problems play an increasingly important role in machine learning and generative modelling. In contrast with optimal transport maps which often have limited applicability in high dimensions, Schrodinger bridges can be solved using the celebrated Sinkhorn’s algorithm, a.k.a. the iterative proportional fitting procedure. The stability properties of Sinkhorn bridges when the number of iterations tends to infinity is a very active research area in applied probability and machine learning. Traditional proofs […]

Ver mais

Like 0

Liked Liked

technocracy

Conformal Unlearning: A New Paradigm for Unlearning in Conformal Predictors

digitado ⋅ 13 de February de 2026

arXiv:2508.03245v4 Announce Type: replace-cross Abstract: Conformal unlearning aims to ensure that a trained conformal predictor miscovers data points with specific shared characteristics, such as those from a particular label class, associated with a specific user, or belonging to a defined cluster, while maintaining valid coverage on the remaining data. Existing machine unlearning methods, which typically approximate a model retrained from scratch after removing the data to be forgotten, face significant challenges when applied to conformal unlearning. These methods […]

Ver mais

Like 0

Liked Liked

technocracy

Stochastic Matching Bandits with Rare Optimization Updates

digitado ⋅ 30 de January de 2026

arXiv:2509.04194v2 Announce Type: replace Abstract: We introduce a bandit framework for stochastic matching under the multinomial logit (MNL) choice model. In our setting, $N$ agents on one side are assigned to $K$ arms on the other side, where each arm stochastically selects an agent from its assigned pool according to unknown preferences and yields a corresponding reward over a horizon $T$. The objective is to minimize regret by maximizing the cumulative revenue from successful matches. A naive approach […]

Ver mais

Like 0

Liked Liked

technocracy

PhenoLIP: Integrating Phenotype Ontology Knowledge into Medical Vision-Language Pretraining

digitado ⋅ 9 de February de 2026

arXiv:2602.06184v1 Announce Type: new Abstract: Recent progress in large-scale CLIP-like vision-language models(VLMs) has greatly advanced medical image analysis. However, most existing medical VLMs still rely on coarse image-text contrastive objectives and fail to capture the systematic visual knowledge encoded in well-defined medical phenotype ontologies. To address this gap, we construct PhenoKG, the first large-scale, phenotype-centric multimodal knowledge graph that encompasses over 520K high-quality image-text pairs linked to more than 3,000 phenotypes. Building upon PhenoKG, we propose PhenoLIP, a […]

Ver mais

Like 0

Liked Liked

technocracy

Variance-Gated Ensembles: An Epistemic-Aware Framework for Uncertainty Estimation

digitado ⋅ 10 de February de 2026

arXiv:2602.08142v1 Announce Type: cross Abstract: Machine learning applications require fast and reliable per-sample uncertainty estimation. A common approach is to use predictive distributions from Bayesian or approximation methods and additively decompose uncertainty into aleatoric (i.e., data-related) and epistemic (i.e., model-related) components. However, additive decomposition has recently been questioned, with evidence that it breaks down when using finite-ensemble sampling and/or mismatched predictive distributions. This paper introduces Variance-Gated Ensembles (VGE), an intuitive, differentiable framework that injects epistemic sensitivity via a […]

Ver mais

Like 0

Liked Liked

technocracy

Accelerated Regularized Wasserstein Proximal Sampling Algorithms

digitado ⋅ 19 de January de 2026

arXiv:2601.09848v2 Announce Type: replace Abstract: We consider sampling from a Gibbs distribution by evolving a finite number of particles using a particular score estimator rather than Brownian motion. To accelerate the particles, we consider a second-order score-based ODE, similar to Nesterov acceleration. In contrast to traditional kernel density score estimation, we use the recently proposed regularized Wasserstein proximal method, yielding the Accelerated Regularized Wasserstein Proximal method (ARWP). We provide a detailed analysis of continuous- and discrete-time non-asymptotic and […]

Ver mais

Like 0

Liked Liked

technocracy

VoxCog: Towards End-to-End Multilingual Cognitive Impairment Classification through Dialectal Knowledge

digitado ⋅ 14 de January de 2026

arXiv:2601.07999v1 Announce Type: new Abstract: In this work, we present a novel perspective on cognitive impairment classification from speech by integrating speech foundation models that explicitly recognize speech dialects. Our motivation is based on the observation that individuals with Alzheimer’s Disease (AD) or mild cognitive impairment (MCI) often produce measurable speech characteristics, such as slower articulation rate and lengthened sounds, in a manner similar to dialectal phonetic variations seen in speech. Building on this idea, we introduce VoxCog, […]

Ver mais

Like 0

Liked Liked

technocracy

Quantum Optimization for Access Point Selection Under Budget Constraint

digitado ⋅ 18 de February de 2026

arXiv:2602.15049v1 Announce Type: new Abstract: Optimal Access Point (AP) selection is crucial for accurate indoor localization, yet it is constrained by budget, creating a trade-off between localization accuracy and deployment cost. Classical approaches to AP selection are often computationally expensive, hindering their application in large-scale 3D indoor environments. In this paper, we introduce a quantum APs selection algorithm under a budget constraint. The proposed algorithm leverages quantum annealing to identify the most effective subset of APs allowed within […]

Ver mais

Like 0

Liked Liked