digitado

CAGE: Bridging the Accuracy-Aesthetics Gap in Educational Diagrams via Code-Anchored Generative Enhancement

digitado ⋅ 15 de April de 2026

arXiv:2604.09691v1 Announce Type: new Abstract: Educational diagrams — labeled illustrations of biological processes, chemical structures, physical systems, and mathematical concepts — are essential cognitive tools in K-12 instruction. Yet no existing method can generate them both accurately and engagingly. Open-source diffusion models produce visually rich images but catastrophically garble text labels. Code-based generation via LLMs guarantees label correctness but yields visually flat outputs. Closed-source APIs partially bridge this gap but remain unreliable and prohibitively expensive at educational scale. […]

Ver mais

Like 0

Liked Liked

technocracy

Get working on your April Fools Eiffel Tower

digitado ⋅ 1 de April de 2026

Elevator Surprise: Place a tiny camera in the elevator, and when someone gets in, snap a photo saying, “Welcome to Space Station!” Or build a miniature model of the Eiffel Tower next to it for a dramatic effect. Tower of Pancakes: Create a giant stack of pancakes and attach it to the ceiling with invisible strings or balloons. They won’t believe it’s real! Lost Cat: Suspend a toy cat from the ceiling using fishing lines. Watch as friends […]

Ver mais

Like 0

Liked Liked

technocracy

FlowRL: A Taxonomy and Modular Framework for Reinforcement Learning with Diffusion Policies

digitado ⋅ 29 de March de 2026

Thanks to their remarkable flexibility, diffusion models and flow models have emerged as promising candidates for policy representation. However, efficient reinforcement learning (RL) upon these policies remains a challenge due to the lack of explicit log-probabilities for vanilla policy gradient estimators. While numerous attempts have been proposed to address this, the field lacks a unified perspective to reconcile these seemingly disparate methods, thus hampering ongoing development. In this paper, we bridge this gap by introducing a comprehensive taxonomy […]

Ver mais

Like 0

Liked Liked

technocracy

Neural Network Tuning of FSMPC for Drives

digitado ⋅ 11 de March de 2026

arXiv:2603.08816v1 Announce Type: new Abstract: This preprint presents a neural network tuner for the finite state model predictive control of an induction motor. The tuner deals with the parameters of the controllers in the speed loop and in the stator current loop. The results are assessed using a five phase machine in an experimental setup. Data for the neural network training is obtained from the experiments using step tests.

Ver mais

Like 0

Liked Liked

technocracy

Escaping Model Collapse via Synthetic Data Verification: Near-term Improvements and Long-term Convergence

digitado ⋅ 9 de March de 2026

arXiv:2510.16657v2 Announce Type: replace Abstract: Synthetic data has been increasingly used to train frontier generative models. However, recent studies raise key concerns that iteratively retraining a generative model on its self-generated synthetic data may keep deteriorating model performance, a phenomenon often coined model collapse. In this paper, we investigate ways to modify the synthetic retraining process to avoid model collapse, and even possibly help reverse the trend from collapse to improvement. Our key finding is that by injecting […]

Ver mais

Like 0

Liked Liked

technocracy

How I Beat Context Rot and Saved 6 Out of 47 AI Agents in Production

digitado ⋅ 5 de June de 2026

Late 2025 through early 2026 was one of the most intense periods of my career. I was deep in the trenches, designing and deploying AI agents for real enterprise clients in fintech, healthcare payments, and compliance. Out of 47 agents we pushed into production, only 6 are still delivering consistent value today. The difference wasn’t better LLMs or more clever prompting. It came down to figuring out how to fight something I started calling Context Rot. As I […]

Ver mais

Like 0

Liked Liked

technocracy

A Unifying View of Coverage in Linear Off-Policy Evaluation

digitado ⋅ 28 de January de 2026

arXiv:2601.19030v1 Announce Type: cross Abstract: Off-policy evaluation (OPE) is a fundamental task in reinforcement learning (RL). In the classic setting of linear OPE, finite-sample guarantees often take the form $$ textrm{Evaluation error} le textrm{poly}(C^pi, d, 1/n,log(1/delta)), $$ where $d$ is the dimension of the features and $C^pi$ is a coverage parameter that characterizes the degree to which the visited features lie in the span of the data distribution. While such guarantees are well-understood for several popular algorithms under […]

Ver mais

Like 0

Liked Liked

technocracy

DREAM: Dual-Standard Semantic Homogeneity with Dynamic Optimization for Graph Learning with Label Noise

digitado ⋅ 24 de January de 2026

Graph neural networks (GNNs) have been widely used in various graph machine learning scenarios. Existing literature primarily assumes well-annotated training graphs, while the reliability of labels is not guaranteed in real-world scenarios. Recently, efforts have been made to address the problem of graph learning with label noise. However, existing methods often (i) struggle to distinguish between reliable and unreliable nodes, and (ii) overlook the relational information embedded in the graph topology. To tackle this problem, this paper proposes […]

Ver mais

Like 0

Liked Liked

technocracy

Towards Verified and Targeted Explanations through Formal Methods

digitado ⋅ 18 de April de 2026

arXiv:2604.14209v1 Announce Type: new Abstract: As deep neural networks are deployed in safety-critical domains such as autonomous driving and medical diagnosis, stakeholders need explanations that are interpretable but also trustworthy with formal guarantees. Existing XAI methods fall short: heuristic attribution techniques (e.g., LIME, Integrated Gradients) highlight influential features but offer no mathematical guarantees about decision boundaries, while formal methods verify robustness yet remain untargeted, analyzing the nearest boundary regardless of whether it represents a critical risk. In safety-critical […]

Ver mais

Like 0

Liked Liked

technocracy

How Much Temporal Modeling is Enough? A Systematic Study of Hybrid CNN-RNN Architectures for Multi-Label ECG Classification

digitado ⋅ 28 de January de 2026

arXiv:2601.18830v1 Announce Type: new Abstract: Accurate multi-label classification of electrocardiogram (ECG) signals remains challenging due to the coexistence of multiple cardiac conditions, pronounced class imbalance, and long-range temporal dependencies in multi-lead recordings. Although recent studies increasingly rely on deep and stacked recurrent architectures, the necessity and clinical justification of such architectural complexity have not been rigorously examined. In this work, we perform a systematic comparative evaluation of convolutional neural networks (CNNs) combined with multiple recurrent configurations, including LSTM, […]

Ver mais

Like 0

Liked Liked