digitado

KernelBlaster: Continual Cross-Task CUDA Optimization via Memory-Augmented In-Context Reinforcement Learning

digitado ⋅ 15 de February de 2026

Optimizing CUDA code across multiple generations of GPU architectures is challenging, as achieving peak performance requires an extensive exploration of an increasingly complex, hardware-specific optimization space. Traditional compilers are constrained by fixed heuristics, whereas finetuning Large Language Models (LLMs) can be expensive. However, agentic workflows for CUDA code optimization have limited ability to aggregate knowledge from prior exploration, leading to biased sampling and suboptimal solutions. We propose KernelBlaster, a Memory-Augmented In-context Reinforcement Learning (MAIC-RL) framework designed to improve […]

Ver mais

Like 0

Liked Liked

technocracy

A Practical Guide Towards Interpreting Time-Series Deep Clinical Predictive Models: A Reproducibility Study

digitado ⋅ 27 de March de 2026

arXiv:2603.24828v1 Announce Type: new Abstract: Clinical decisions are high-stakes and require explicit justification, making model interpretability essential for auditing deep clinical models prior to deployment. As the ecosystem of model architectures and explainability methods expands, critical questions remain: Do architectural features like attention improve explainability? Do interpretability approaches generalize across clinical tasks? While prior benchmarking efforts exist, they often lack extensibility and reproducibility, and critically, fail to systematically examine how interpretability varies across the interplay of clinical tasks […]

Ver mais

Like 0

Liked Liked

technocracy

Sampling Parallelism for Fast and Efficient Bayesian Learning

digitado ⋅ 6 de April de 2026

Machine learning models, and deep neural networks in particular, are increasingly deployed in risk-sensitive domains such as healthcare, environmental forecasting, and finance, where reliable quantification of predictive uncertainty is essential. However, many uncertainty quantification (UQ) methods remain difficult to apply due to their substantial computational cost. Sampling-based Bayesian learning approaches, such as Bayesian neural networks (BNNs), are particularly expensive since drawing and evaluating multiple parameter samples rapidly exhausts memory and compute resources. These constraints have limited the accessibility […]

Ver mais

Like 0

Liked Liked

technocracy

A New Convergence Analysis of Plug-and-Play Proximal Gradient Descent Under Prior Mismatch

digitado ⋅ 16 de January de 2026

arXiv:2601.09831v1 Announce Type: new Abstract: In this work, we provide a new convergence theory for plug-and-play proximal gradient descent (PnP-PGD) under prior mismatch where the denoiser is trained on a different data distribution to the inference task at hand. To the best of our knowledge, this is the first convergence proof of PnP-PGD under prior mismatch. Compared with the existing theoretical results for PnP algorithms, our new results removed the need for several restrictive and unverifiable assumptions.

Ver mais

Like 0

Liked Liked

technocracy

Do It for HER: First-Order Temporal Logic Reward Specification in Reinforcement Learning (Extended Version)

digitado ⋅ 5 de February de 2026

In this work, we propose a novel framework for the logical specification of non-Markovian rewards in Markov Decision Processes (MDPs) with large state spaces. Our approach leverages Linear Temporal Logic Modulo Theories over finite traces (LTLfMT), a more expressive extension of classical temporal logic in which predicates are first-order formulas of arbitrary first-order theories rather than simple Boolean variables. This enhanced expressiveness enables the specification of complex tasks over unstructured and heterogeneous data domains, promoting a unified and […]

Ver mais

Like 0

Liked Liked

technocracy

Summer Street

digitado ⋅ 29 de May de 2026

:::info Astounding Stories of Super-Science May 2001, by Astounding Stories is part of HackerNoon’s Book Blog Post series. You can jump to any chapter in this book here. A ROOM WITH A VIEW – Chapter XII – Twelfth Chapter Astounding Stories of Super-Science May 2001: A ROOM WITH A VIEW – Chapter XII – Twelfth Chapter By E. M. Forster ::: It was a Saturday afternoon, gay and brilliant after abundant rains, and the spirit of youth dwelt in […]

Ver mais

Like 0

Liked Liked

technocracy

Conformal Prediction for Nonparametric Instrumental Regression

digitado ⋅ 27 de March de 2026

arXiv:2603.25509v1 Announce Type: cross Abstract: We propose a method for constructing distribution-free prediction intervals in nonparametric instrumental variable regression (NPIV), with finite-sample coverage guarantees. Building on the conditional guarantee framework in conformal inference, we reformulate conditional coverage as marginal coverage over a class of IV shifts $mathcal{F}$. Our method can be combined with any NPIV estimator, including sieve 2SLS and other machine-learning-based NPIV methods such as neural networks minimax approaches. Our theoretical analysis establishes distribution-free, finite-sample coverage over […]

Ver mais

Like 0

Liked Liked

technocracy

OceanBase Bacchus: a High-Performance and Scalable Cloud-Native Shared Storage Architecture for Multi-Cloud

digitado ⋅ 2 de March de 2026

arXiv:2602.23571v1 Announce Type: new Abstract: Although an increasing number of databases now embrace shared-storage architectures, current storage-disaggregated systems have yet to strike an optimal balance between cost and performance. In high-concurrency read/write scenarios, B+-tree-based shared storage struggles to efficiently absorb frequent in-place updates. Existing LSM-tree-backed disaggregated storage designs are hindered by the intricate implementation of cross-node shared-log mechanisms, where no satisfactory solution yet exists. This paper presents OceanBase Bacchus, an LSM-tree architecture tailored for object storage provided by […]

Ver mais

Like 0

Liked Liked

technocracy

Reinforcement Learning Handbook

digitado ⋅ 4 de June de 2026

Hey all, I’ve been building an open RL Handbook as a comprehensive guide for reinforcement learning. Hope you will find it useful 🌐 rl-handbook.com 💻 github.com/lubludrova/rl-handbook Feedback, contribution or GitHub star ⭐ are welcome! submitted by /u/Savings-Shoulder-976 [link] [comments]

Ver mais

Like 0

Liked Liked

technocracy

Verifying Good Regulator Conditions for Hypergraph Observers: Natural Gradient Learning from Causal Invariance via Established Theorems

digitado ⋅ 11 de March de 2026

arXiv:2603.09067v1 Announce Type: new Abstract: We verify that persistent observers in causally invariant hypergraph substrates satisfy the conditions of the Conant-Ashby Good Regulator Theorem. Building on Wolfram’s hypergraph physics and Vanchurin’s neural network cosmology, we formalize persistent observers as entities that minimize prediction error at their boundary with the environment. Applying a modern reformulation of the Conant-Ashby theorem, we demonstrate that hypergraph observers satisfy Good Regulator conditions, requiring them to maintain internal models. Once an internal model with […]

Ver mais

Like 0

Liked Liked