digitado

A Theoretical Framework for Modular Learning of Robust Generative Models

digitado ⋅ 19 de February de 2026

Training large-scale generative models is resource-intensive and relies heavily on heuristic dataset weighting. We address two fundamental questions: Can we train Large Language Models (LLMs) modularly-combining small, domain-specific experts to match monolithic performance-and can we do so robustly for any data mixture, eliminating heuristic tuning? We present a theoretical framework for modular generative modeling where a set of pre-trained experts are combined via a gating mechanism. We define the space of normalized gating functions, $G_{1}$, and formulate the […]

Ver mais

Like 0

Liked Liked

technocracy

Top-k on a Budget: Adaptive Ranking with Weak and Strong Oracles

digitado ⋅ 30 de January de 2026

arXiv:2601.20989v1 Announce Type: new Abstract: Identifying the top-$k$ items is fundamental but often prohibitive when exact valuations are expensive. We study a two-oracle setting with a fast, noisy weak oracle and a scarce, high-fidelity strong oracle (e.g., human expert verification or expensive simulation). We first analyze a simple screen-then-certify baseline (STC) and prove it makes at most $m(4varepsilon_{max})$ strong calls given jointly valid weak confidence intervals with maximum radius $varepsilon_{max}$, where $m(cdot)$ denotes the near-tie mass around the […]

Ver mais

Like 0

Liked Liked

technocracy

OpenAI introduces GPT-5.4 with more knowledge-work capability

digitado ⋅ 5 de March de 2026

In keeping with its recently accelerated release cadence, OpenAI has shipped GPT-5.4 (including GPT-5.4 Thinking and GPT-5.4 Pro). This update comes at a critical time, as recent events have led some vocal users to abandon ship for competing products and models from Anthropic and Google. GPT-5.4 is another model update focused on usefulness for agentic tasks, particularly knowledge work. OpenAI says this is its first model explicitly aimed at computer-use tasks; like competing models, it can issue keyboard […]

Ver mais

Like 0

Liked Liked

technocracy

Learning to accelerate Krasnosel’skii-Mann fixed-point iterations with guarantees

digitado ⋅ 12 de January de 2026

We introduce a principled learning to optimize (L2O) framework for solving fixed-point problems involving general nonexpansive mappings. Our idea is to deliberately inject summable perturbations into a standard Krasnosel’skii-Mann iteration to improve its average-case performance over a specific distribution of problems while retaining its convergence guarantees. Under a metric sub-regularity assumption, we prove that the proposed parametrization includes only iterations that locally achieve linear convergence-up to a vanishing bias term-and that it encompasses all iterations that do so […]

Ver mais

Like 0

Liked Liked

technocracy

StealthRL: Reinforcement Learning Paraphrase Attacks for Multi-Detector Evasion of AI-Text Detectors

digitado ⋅ 9 de February de 2026

AI-text detectors face a critical robustness challenge: adversarial paraphrasing attacks that preserve semantics while evading detection. We introduce StealthRL, a reinforcement learning framework that stress-tests detector robustness under realistic adversarial conditions. StealthRL trains a paraphrase policy against a multi-detector ensemble using Group Relative Policy Optimization (GRPO) with LoRA adapters on Qwen3-4B, optimizing a composite reward that balances detector evasion with semantic preservation. We evaluate six attack settings (M0-M5) against three detector families (RoBERTa, FastDetectGPT, and Binoculars) at the […]

Ver mais

Like 0

Liked Liked

technocracy

Constraint- and Score-Based Nonlinear Granger Causality Discovery with Kernels

digitado ⋅ 15 de January de 2026

arXiv:2601.09579v1 Announce Type: cross Abstract: Kernel-based methods are used in the context of Granger Causality to enable the identification of nonlinear causal relationships between time series variables. In this paper, we show that two state of the art kernel-based Granger Causality (GC) approaches can be theoretically unified under the framework of Kernel Principal Component Regression (KPCR), and introduce a method based on this unification, demonstrating that this approach can improve causal identification. Additionally, we introduce a Gaussian Process […]

Ver mais

Like 0

Liked Liked

technocracy

M-polynomial Based Mathematical Formulation of the Hyperbolic Sombor Index

digitado ⋅ 18 de February de 2026

arXiv:2602.15086v1 Announce Type: new Abstract: The numerical values extracted from a graph that indicates its topology are called topological indices. A contemporary and efficient method is to compute a graph’s topological indices using the graph polynomial that corresponds to it. This method of identifying degree-based topological indices involves the use of the M-polynomial. Very recently, in 2025, the hyperbolic Sombor index (HSO) was proposed and shows its chemical applicability for octane isomers and the structure sensitivity and abruptness […]

Ver mais

Like 0

Liked Liked

technocracy

Executable Ontologies in Game Development: From Algorithmic Control to Semantic World Modeling

digitado ⋅ 14 de January de 2026

arXiv:2601.07964v1 Announce Type: new Abstract: This paper examines the application of Executable Ontologies (EO), implemented through the boldsea framework, to game development. We argue that EO represents a paradigm shift: a transition from algorithmic behavior programming to semantic world modeling, where agent behavior emerges naturally from declarative domain rules rather than being explicitly coded. Using a survival game scenario (Winter Feast), we demonstrate how EO achieves prioritybased task interruption through dataflow conditions rather than explicit preemption logic. Comparison […]

Ver mais

Like 0

Liked Liked

technocracy

Fairness under Graph Uncertainty: Achieving Interventional Fairness with Partially Known Causal Graphs over Clusters of Variables

digitado ⋅ 2 de March de 2026

arXiv:2602.23611v1 Announce Type: new Abstract: Algorithmic decisions about individuals require predictions that are not only accurate but also fair with respect to sensitive attributes such as gender and race. Causal notions of fairness align with legal requirements, yet many methods assume access to detailed knowledge of the underlying causal graph, which is a demanding assumption in practice. We propose a learning framework that achieves interventional fairness by leveraging a causal graph over textit{clusters of variables}, which is substantially […]

Ver mais

Like 0

Liked Liked

technocracy

Applying the maximum entropy principle to neural networks enhances multi-species distribution models

digitado ⋅ 14 de January de 2026

arXiv:2412.19217v4 Announce Type: replace-cross Abstract: The rapid expansion of citizen science initiatives has led to a significant growth of biodiversity databases, and particularly presence-only (PO) observations. PO data are invaluable for understanding species distributions and their dynamics, but their use in a Species Distribution Model (SDM) is curtailed by sampling biases and the lack of information on absences. Poisson point processes are widely used for SDMs, with Maxent being one of the most popular methods. Maxent maximises the […]

Ver mais

Like 0

Liked Liked