digitado

Intersection of RL and Psychology

digitado ⋅ 5 de May de 2026

Looking for others interested in both Psych and RL. Been working on what was meant to be a basic human model, turned into what could be a better understanding of humans in general. Please let me know what you think: https://narquie.substack.com/p/modeling-a-human-through-reinforcement submitted by /u/EdgarKafka [link] [comments]

Ver mais

Like 0

Liked Liked

technocracy

Kakao Mobility details Level 4 autonomous driving roadmap for physical AI

digitado ⋅ 28 de April de 2026

Kakao Mobility has set out plans to develop Level 4 autonomous driving technologies in-house as part of its physical AI strategy. Kim Jin-kyu, vice president and head of Kakao Mobility’s Physical AI division, presented the roadmap at the 2026 World IT Show conference at COEX in Seoul. His session focused on autonomous driving services built around mobility platforms in the physical AI era. The event was held under the title “Beyond Idea, Into Action: AI moves Reality,” with […]

Ver mais

Like 0

Liked Liked

technocracy

Online Learning for Uninformed Markov Games: Empirical Nash-Value Regret and Non-Stationarity Adaptation

digitado ⋅ 6 de February de 2026

We study online learning in two-player uninformed Markov games, where the opponent’s actions and policies are unobserved. In this setting, Tian et al. (2021) show that achieving no-external-regret is impossible without incurring an exponential dependence on the episode length $H$. They then turn to the weaker notion of Nash-value regret and propose a V-learning algorithm with regret $O(K^{2/3})$ after $K$ episodes. However, their algorithm and guarantee do not adapt to the difficulty of the problem: even in the […]

Ver mais

Like 0

Liked Liked

technocracy

Symmetric Aggregation of Conformity Scores for Efficient Uncertainty Sets

digitado ⋅ 6 de March de 2026

arXiv:2512.06945v2 Announce Type: replace Abstract: Access to multiple predictive models trained for the same task, whether in regression or classification, is increasingly common in many applications. Aggregating their predictive uncertainties to produce reliable and efficient uncertainty quantification is therefore a critical but still underexplored challenge, especially within the framework of conformal prediction (CP). While CP methods can generate individual prediction sets from each model, combining them into a single, more informative set remains a challenging problem. To address […]

Ver mais

Like 0

Liked Liked

technocracy

Learning Complex Physical Regimes via Coverage-oriented Uncertainty Quantification: An application to the Critical Heat Flux

digitado ⋅ 25 de February de 2026

A central challenge in scientific machine learning (ML) is the correct representation of physical systems governed by multi-regime behaviours. In these scenarios, standard data analysis techniques often fail to capture the nature of the data, as the system’s response varies significantly across the state space due to its stochasticity and the different physical regimes. Uncertainty quantification (UQ) should thus not be viewed merely as a safety assessment, but as a support to the learning task itself, guiding the […]

Ver mais

Like 0

Liked Liked

technocracy

Surface-Constrained Offline Warping with Contact-Aware Online Pose Projection for Safe Robotic Trajectory Execution

digitado ⋅ 31 de March de 2026

arXiv:2603.26711v1 Announce Type: new Abstract: Robotic manipulation tasks that require repeated tool motion along curved surfaces frequently arise in surface finishing, inspection, and guided interaction. In practice, nominal motion primitives are often designed independently of the deployment surface and later reused across varying geometries. Directly tiling such primitives onto nonplanar surfaces introduces geometric inconsistencies, leading to interpenetration, orientation discontinuities, and cumulative drift over repeated cycles. We present a two-stage framework that separates geometric embedding from execution-level regulation. An […]

Ver mais

Like 0

Liked Liked

technocracy

Modeling and Control for UAV with Off-center Slung Load

digitado ⋅ 8 de January de 2026

arXiv:2601.03386v1 Announce Type: new Abstract: Unmanned aerial vehicle (UAV) with slung load system is a classic air transportation system. In practical applications, the suspension point of the slung load does not always align with the center of mass (CoM) of the UAV due to mission requirements or mechanical interference. This offset creates coupling in the system’s nonlinear dynamics which leads to a complicated motion control problem. In existing research, modeling of the system are performed about the UAV’s […]

Ver mais

Like 0

Liked Liked

technocracy

llm-all-models-async 0.1

digitado ⋅ 31 de March de 2026

Release: llm-all-models-async 0.1 LLM plugins can define new models in both sync and async varieties. The async variants are most common for API-backed models – sync variants tend to be things that run the model directly within the plugin. My llm-mrchatterbox plugin is sync only. I wanted to try it out with various Datasette LLM features (specifically datasette-enrichments-llm) but Datasette can only use async models. So… I had Claude spin up this plugin that turns sync models into […]

Ver mais

Like 0

Liked Liked

technocracy

Anthropic calls out China’s AI copycats

digitado ⋅ 24 de February de 2026

Read Online | Sign Up | Advertise Good morning, {{ first_name | AI enthusiasts }}. China’s AI rise has been one of the most impressive stories in tech, but Anthropic says it had help from an unlikely source: Claude itself. The company just revealed that Chinese labs DeepSeek, MiniMax, and Moonshot ran a combined 16M exchanges across thousands of fake accounts to clone Claude’s capabilities into their own models, a scheme it says demands industry-wide action. In today’s […]

Ver mais

Like 0

Liked Liked

technocracy

Regularity of Solutions to Beckmann’s Parametric Optimal Transport

digitado ⋅ 23 de March de 2026

arXiv:2603.19755v1 Announce Type: cross Abstract: Beckmann’s problem in optimal transport minimizes the total squared flux in a continuous transport problem from a source to a target distribution. In this article, the regularity theory for solutions to Beckmann’s problem in optimal transport is developed utilizing an unconstrained Lagrangian formulation and solving the variational first order optimality conditions. It turns out that the Lagrangian multiplier that enforces Beckmann’s divergence constraint fulfills a Poisson equation and the flux vector field is […]

Ver mais

Like 0

Liked Liked