digitado

The DeepXube Software Package for Solving Pathfinding Problems with Learned Heuristic Functions and Search

digitado ⋅ 25 de March de 2026

DeepXube is a free and open-source Python package and command-line tool that seeks to automate the solution of pathfinding problems by using machine learning to learn heuristic functions that guide heuristic search algorithms tailored to deep neural networks (DNNs). DeepXube is comprised of the latest advances in deep reinforcement learning, heuristic search, and formal logic for solving pathfinding problems. This includes limited-horizon Bellman-based learning, hindsight experience replay, batched heuristic search, and specifying goals with answer-set programming. A robust […]

Ver mais

Like 0

Liked Liked

technocracy

Statistical-Neural Interaction Networks for Interpretable Mixed-Type Data Imputation

digitado ⋅ 21 de January de 2026

arXiv:2601.12380v1 Announce Type: cross Abstract: Real-world tabular databases routinely combine continuous measurements and categorical records, yet missing entries are pervasive and can distort downstream analysis. We propose Statistical-Neural Interaction (SNI), an interpretable mixed-type imputation framework that couples correlation-derived statistical priors with neural feature attention through a Controllable-Prior Feature Attention (CPFA) module. CPFA learns head-wise prior-strength coefficients ${lambda_h}$ that softly regularize attention toward the prior while allowing data-driven deviations when nonlinear patterns appear to be present in the data. […]

Ver mais

Like 0

Liked Liked

technocracy

Space-Efficient Text Indexing with Mismatches using Function Inversion

digitado ⋅ 3 de April de 2026

arXiv:2604.01307v1 Announce Type: new Abstract: A classic data structure problem is to preprocess a string T of length $n$ so that, given a query $q$, we can quickly find all substrings of T with Hamming distance at most $k$ from the query string. Variants of this problem have seen significant research both in theory and in practice. For a wide parameter range, the best worst-case bounds are achieved by the “CGL tree” (Cole, Gottlieb, Lewenstein 2004), which achieves […]

Ver mais

Like 0

Liked Liked

technocracy

Low-Rank Online Dynamic Assortment with Dual Contextual Information

digitado ⋅ 16 de February de 2026

arXiv:2404.17592v3 Announce Type: replace-cross Abstract: As e-commerce expands, delivering real-time personalized recommendations from vast catalogs poses a critical challenge for retail platforms. Maximizing revenue requires careful consideration of both individual customer characteristics and available item features to continuously optimize assortments over time. In this paper, we consider the dynamic assortment problem with dual contexts — user and item features. In high-dimensional scenarios, the quadratic growth of dimensions complicates computation and estimation. To tackle this challenge, we introduce a […]

Ver mais

Like 0

Liked Liked

technocracy

¿Guerras cada vez más algorítmicas?

digitado ⋅ 3 de March de 2026

Durante años hemos hablado de la inteligencia artificial como una herramienta para optimizar procesos empresariales, personalizar publicidad o escribir correos más rápidos. Pero los acontecimientos recientes en Venezuela e Irán apuntan a otra dirección mucho menos cómoda: la progresiva integración de modelos comerciales de inteligencia artificial en operaciones militares reales. No como un complemento anecdótico, sino como parte del sistema nervioso de la decisión. Un artículo reciente del Wall Street Journal, titulado «U.S. strikes in Middle East use […]

Ver mais

Like 0

Liked Liked

technocracy

Do More Predictions Improve Statistical Inference? Filtered Prediction-Powered Inference

digitado ⋅ 12 de February de 2026

arXiv:2602.10464v1 Announce Type: cross Abstract: Recent advances in artificial intelligence have enabled the generation of large-scale, low-cost predictions with increasingly high fidelity. As a result, the primary challenge in statistical inference has shifted from data scarcity to data reliability. Prediction-powered inference methods seek to exploit such predictions to improve efficiency when labeled data are limited. However, existing approaches implicitly adopt a use-all philosophy, under which incorporating more predictions is presumed to improve inference. When prediction quality is heterogeneous, […]

Ver mais

Like 0

Liked Liked

technocracy

Non-Convex Portfolio Optimization via Energy-Based Models: A Comparative Analysis Using the Thermodynamic HypergRaphical Model Library (THRML) for Index Tracking

digitado ⋅ 13 de January de 2026

arXiv:2601.07792v1 Announce Type: cross Abstract: Portfolio optimization under cardinality constraints transforms the classical Markowitz mean-variance problem from a convex quadratic problem into an NP-hard combinatorial optimization problem. This paper introduces a novel approach using THRML (Thermodynamic HypergRaphical Model Library), a JAX-based library for building and sampling probabilistic graphical models that reformulates index tracking as probabilistic inference on an Ising Hamiltonian. Unlike traditional methods that seek a single optimal solution, THRML samples from the Boltzmann distribution of high-quality portfolios […]

Ver mais

Like 0

Liked Liked

technocracy

Time-Aware Synthetic Control

digitado ⋅ 7 de January de 2026

arXiv:2601.03099v1 Announce Type: cross Abstract: The synthetic control (SC) framework is widely used for observational causal inference with time-series panel data. SC has been successful in diverse applications, but existing methods typically treat the ordering of pre-intervention time indices interchangeable. This invariance means they may not fully take advantage of temporal structure when strong trends are present. We propose Time-Aware Synthetic Control (TASC), which employs a state-space model with a constant trend while preserving a low-rank structure of […]

Ver mais

Like 0

Liked Liked

technocracy

VLM-UQBench: A Benchmark for Modality-Specific and Cross-Modality Uncertainties in Vision Language Models

digitado ⋅ 11 de February de 2026

arXiv:2602.09214v1 Announce Type: new Abstract: Uncertainty quantification (UQ) is vital for ensuring that vision-language models (VLMs) behave safely and reliably. A central challenge is to localize uncertainty to its source, determining whether it arises from the image, the text, or misalignment between the two. We introduce VLM-UQBench, a benchmark for modality-specific and cross-modal data uncertainty in VLMs, It consists of 600 real-world samples drawn from the VizWiz dataset, curated into clean, image-, text-, and cross-modal uncertainty subsets, and […]

Ver mais

Like 0

Liked Liked

technocracy

Matching Rates and Optimal Allocation for Federated Probe-Logit Distillation under Heterogeneous Bandwidth Budgets

digitado ⋅ 29 de May de 2026

arXiv:2605.29642v1 Announce Type: new Abstract: In federated language modeling, $K$ nodes each hold $n$ samples but cannot pool data or exchange full-precision gradients or weights. We study the minimax rate at which a conditional distribution over $V$ tokens can be estimated when each node may upload at most $B$ bits per query in a public probe set. In federated probe-logit distillation (FPLD), each node transmits a scalar-quantized logit vector on the probe set, and an aggregator distills a […]

Ver mais

Like 0

Liked Liked