digitado

Can Large Language Models Derive New Knowledge? A Dynamic Benchmark for Biological Knowledge Discovery

digitado ⋅ 5 de March de 2026

arXiv:2603.03322v1 Announce Type: new Abstract: Recent advancements in Large Language Model (LLM) agents have demonstrated remarkable potential in automatic knowledge discovery. However, rigorously evaluating an AI’s capacity for knowledge discovery remains a critical challenge. Existing benchmarks predominantly rely on static datasets, leading to inevitable data contamination where models have likely seen the evaluation knowledge during training. Furthermore, the rapid release cycles of modern LLMs render static benchmarks quickly outdated, failing to assess the ability to discover truly new […]

Ver mais

Like 0

Liked Liked

technocracy

Active Causal Experimentalist (ACE): Learning Intervention Strategies via Direct Preference Optimization

digitado ⋅ 2 de February de 2026

Discovering causal relationships requires controlled experiments, but experimentalists face a sequential decision problem: each intervention reveals information that should inform what to try next. Traditional approaches such as random sampling, greedy information maximization, and round-robin coverage treat each decision in isolation, unable to learn adaptive strategies from experience. We propose Active Causal Experimentalist (ACE), which learns experimental design as a sequential policy. Our key insight is that while absolute information gains diminish as knowledge accumulates (making value-based RL […]

Ver mais

Like 0

Liked Liked

technocracy

AI Startup ‘Sparkli’ Wants to Make Kids’ Access to AI Safer and More Interactive With an App

digitado ⋅ 22 de January de 2026

These days, most big tech companies and startups are making AI tools that are useful for the adult population and have guardrails in place to protect kids. So the question is, is there any company that’s doing something about kids’ safer access to AI, in a way that isn’t just limited to text or voice? Sparkli believes in AI that’s safer and interactive for kids Well, it seems one company is actually doing something about bringing generative AI […]

Ver mais

Like 0

Liked Liked

technocracy

Iterative Quantum Feature Maps

digitado ⋅ 9 de March de 2026

arXiv:2506.19461v3 Announce Type: replace-cross Abstract: Quantum machine learning models that leverage quantum circuits as quantum feature maps (QFMs) are recognized for their enhanced expressive power in learning tasks. Such models have demonstrated rigorous end-to-end quantum speedups for specific families of classification problems. However, deploying deep QFMs on real quantum hardware remains challenging due to circuit noise and hardware constraints. Additionally, variational quantum algorithms often suffer from computational bottlenecks, particularly in accurate gradient estimation, which significantly increases quantum resource […]

Ver mais

Like 0

Liked Liked

technocracy

ByteDance stuns the AI video world

digitado ⋅ 10 de February de 2026

Read Online | Sign Up | Advertise Good morning, {{ first_name | AI enthusiasts }}. China’s AI labs are on a tear in the video space — and ByteDance’s Seedance 2.0 might be the most impressive entry yet. With viral examples coming out of its beta across a range of styles and use cases that look stronger than anything available, the TikTok parent is making a serious case that the next creative leap in AI video is coming […]

Ver mais

Like 0

Liked Liked

technocracy

When More Experts Hurt: Underfitting in Multi-Expert Learning to Defer

digitado ⋅ 19 de February de 2026

Learning to Defer (L2D) enables a classifier to abstain from predictions and defer to an expert, and has recently been extended to multi-expert settings. In this work, we show that multi-expert L2D is fundamentally more challenging than the single-expert case. With multiple experts, the classifier’s underfitting becomes inherent, which seriously degrades prediction performance, whereas in the single-expert setting it arises only under specific conditions. We theoretically reveal that this stems from an intrinsic expert identifiability issue: learning which […]

Ver mais

Like 0

Liked Liked

technocracy

Uncertainty-Aware Classifier with Physics-Based Rejection (UA-PBR): A Proof-of-Concept Under Computational Constraints—Revised Version

digitado ⋅ 11 de March de 2026

Deep learning classifiers deployed in scientific applications often encounter inputs that violate physical laws (e.g., due to sensor failure or corruption). Standard methods cannot detect such violations and may produce confident but wrong predictions. We propose UA-PBR, a framework that combines a physics-informed autoencoder (to detect physics violations) with a Bayesian CNN (to quantify predictive uncertainty). Inputs are rejected if either the PDE residual exceeds a threshold or the predictive entropy is too high. As a proof-of-concept, we […]

Ver mais

Like 0

Liked Liked

technocracy

4DPC$^2$hat: Towards Dynamic Point Cloud Understanding with Failure-Aware Bootstrapping

digitado ⋅ 5 de February de 2026

arXiv:2602.03890v1 Announce Type: new Abstract: Point clouds provide a compact and expressive representation of 3D objects, and have recently been integrated into multimodal large language models (MLLMs). However, existing methods primarily focus on static objects, while understanding dynamic point cloud sequences remains largely unexplored. This limitation is mainly caused by the lack of large-scale cross-modal datasets and the difficulty of modeling motions in spatio-temporal contexts. To bridge this gap, we present 4DPC$^2$hat, the first MLLM tailored for dynamic […]

Ver mais

Like 0

Liked Liked

technocracy

Flexible Cutoff Learning: Optimizing Machine Learning Potentials After Training

digitado ⋅ 10 de March de 2026

We introduce Flexible Cutoff Learning (FCL), a method for training machine learning interatomic potentials (MLIPs) whose cutoff radii can be adjusted after training. Unlike conventional MLIPs that fix the cutoff radius during training, FCL models are trained by randomly sampling cutoff radii independently for each atom. The resulting model can then be deployed with different per-atom cutoff radii depending on the application, enabling application-specific optimization of the accuracy-cost tradeoff. Using a differentiable cost model, these per-atom cutoffs can […]

Ver mais

Like 0

Liked Liked

technocracy

General Bayesian Policy Learning

digitado ⋅ 27 de February de 2026

This study proposes the General Bayes framework for policy learning. We consider decision problems in which a decision-maker chooses an action from an action set to maximize its expected welfare. Typical examples include treatment choice and portfolio selection. In such problems, the statistical target is a decision rule, and the prediction of each outcome $Y(a)$ is not necessarily of primary interest. We formulate this policy learning problem by loss-based Bayesian updating. Our main technical device is a squared-loss […]

Ver mais

Like 0

Liked Liked