digitado

datasette-llm 0.1a1

digitado ⋅ 25 de March de 2026

Release: datasette-llm 0.1a1 New release of the base plugin that makes models from LLM available for use by other Datasette plugins such as datasette-enrichments-llm. New register_llm_purposes() plugin hook and get_purposes() function for retrieving registered purpose strings. #1 One of the responsibilities of this plugin is to configure which models are used for which purposes, so you can say in one place “data enrichment uses GPT-5.4-nano but SQL query assistance happens using Sonnet 4.6”, for example. Plugins that depend […]

Ver mais

Like 0

Liked Liked

technocracy

$kappa$-Explorer: A Unified Framework for Active Model Estimation in MDPs

digitado ⋅ 25 de February de 2026

arXiv:2602.20404v1 Announce Type: new Abstract: In tabular Markov decision processes (MDPs) with perfect state observability, each trajectory provides active samples from the transition distributions conditioned on state-action pairs. Consequently, accurate model estimation depends on how the exploration policy allocates visitation frequencies in accordance with the intrinsic complexity of each transition distribution. Building on recent work on coverage-based exploration, we introduce a parameterized family of decomposable and concave objective functions $U_kappa$ that explicitly incorporate both intrinsic estimation complexity and […]

Ver mais

Like 0

Liked Liked

technocracy

Multiagent Reinforcement Learning with Neighbor Action Estimation

digitado ⋅ 8 de January de 2026

Multiagent reinforcement learning, as a prominent intelligent paradigm, enables collaborative decision-making within complex systems. However, existing approaches often rely on explicit action exchange between agents to evaluate action value functions, which is frequently impractical in real-world engineering environments due to communication constraints, latency, energy consumption, and reliability requirements. From an artificial intelligence perspective, this paper proposes an enhanced multiagent reinforcement learning framework that employs action estimation neural networks to infer agent behaviors. By integrating a lightweight action estimation […]

Ver mais

Like 0

Liked Liked

technocracy

Memory Retention Is Not Enough to Master Memory Tasks in Reinforcement Learning

digitado ⋅ 21 de January de 2026

Effective decision-making in the real world depends on memory that is both stable and adaptive: environments change over time, and agents must retain relevant information over long horizons while also updating or overwriting outdated content when circumstances shift. Existing Reinforcement Learning (RL) benchmarks and memory-augmented agents focus primarily on retention, leaving the equally critical ability of memory rewriting largely unexplored. To address this gap, we introduce a benchmark that explicitly tests continual memory updating under partial observability, i.e. […]

Ver mais

Like 0

Liked Liked

technocracy

Combee: Scaling Prompt Learning for Self-Improving Language Model Agents

digitado ⋅ 5 de April de 2026

Recent advances in prompt learning allow large language model agents to acquire task-relevant knowledge from inference-time context without parameter changes. For example, existing methods (like ACE or GEPA) can learn system prompts to improve accuracy based on previous agent runs. However, these methods primarily focus on single-agent or low-parallelism settings. This fundamentally limits their ability to efficiently learn from a large set of collected agentic traces. It would be efficient and beneficial to run prompt learning in parallel […]

Ver mais

Like 0

Liked Liked

technocracy

MCLR: Improving Conditional Modeling in Visual Generative Models via Inter-Class Likelihood-Ratio Maximization and Establishing the Equivalence between Classifier-Free Guidance and Alignment Objectives

digitado ⋅ 25 de March de 2026

arXiv:2603.22364v1 Announce Type: new Abstract: Diffusion models have achieved state-of-the-art performance in generative modeling, but their success often relies heavily on classifier-free guidance (CFG), an inference-time heuristic that modifies the sampling trajectory. From a theoretical perspective, diffusion models trained with standard denoising score matching (DSM) are expected to recover the target data distribution, raising the question of why inference-time guidance is necessary in practice. In this work, we ask whether the DSM training objective can be modified in […]

Ver mais

Like 0

Liked Liked

technocracy

Adaptive Sliding Mode Control for Vehicle Platoons with State-Dependent Friction Uncertainty

digitado ⋅ 19 de January de 2026

arXiv:2601.10724v1 Announce Type: new Abstract: Multi-robot formation control has various applications in domains such as vehicle troops, platoons, payload transportation, and surveillance. Maintaining formation in a vehicle platoon requires designing a suitable control scheme that can tackle external disturbances and uncertain system parameters while maintaining a predefined safe distance between the robots. A crucial challenge in this context is dealing with the unknown/uncertain friction forces between wheels and the ground, which vary with changes in road surface, wear […]

Ver mais

Like 0

Liked Liked

technocracy

CROCS: A Two-Stage Clustering Framework for Behaviour-Centric Consumer Segmentation with Smart Meter Data

digitado ⋅ 16 de January de 2026

arXiv:2601.10494v1 Announce Type: new Abstract: With grid operators confronting rising uncertainty from renewable integration and a broader push toward electrification, Demand-Side Management (DSM) — particularly Demand Response (DR) — has attracted significant attention as a cost-effective mechanism for balancing modern electricity systems. Unprecedented volumes of consumption data from a continuing global deployment of smart meters enable consumer segmentation based on real usage behaviours, promising to inform the design of more effective DSM and DR programs. However, existing clustering-based […]

Ver mais

Like 0

Liked Liked

technocracy

Pendle Joins Vietnam IFC Delegation Alongside BlackRock, Morgan Stanley, and Deutsche Bank

digitado ⋅ 31 de March de 2026

Singapore, Singapore, March 31st, 2026/Chainwire/–Pendle announces its CEO TN Lee represented the protocol at a high-level financial delegation in New York alongside representatives from Deutsche Bank, Morgan Stanley, BlackRock, Franklin Templeton, and Anchorage Digital. The group met with Vietnam’s Deputy Prime Minister to build the investment case for Vietnam’s International Financial Center, a landmark initiative positioning Southeast Asia as a destination for global institutional capital. For Pendle, this is more than a diplomatic milestone. It signals the moment […]

Ver mais

Like 0

Liked Liked

technocracy

Universal Inverse Distillation for Matching Models with Real-Data Supervision (No GANs)

digitado ⋅ 20 de March de 2026

arXiv:2509.22459v3 Announce Type: replace Abstract: While achieving exceptional generative quality, modern diffusion, flow, and other matching models suffer from slow inference, as they require many steps of iterative generation. Recent distillation methods address this by training efficient one-step generators under the guidance of a pre-trained teacher model. However, these methods are often constrained to only one specific framework, e.g., only to diffusion or only to flow models. Furthermore, these methods are naturally data-free, and to benefit from the […]

Ver mais

Like 0

Liked Liked