digitado

Rethinking the Design Space of Reinforcement Learning for Diffusion Models: On the Importance of Likelihood Estimation Beyond Loss Design

digitado ⋅ 4 de February de 2026

Reinforcement learning has been widely applied to diffusion and flow models for visual tasks such as text-to-image generation. However, these tasks remain challenging because diffusion models have intractable likelihoods, which creates a barrier for directly applying popular policy-gradient type methods. Existing approaches primarily focus on crafting new objectives built on already heavily engineered LLM objectives, using ad hoc estimators for likelihood, without a thorough investigation into how such estimation affects overall algorithmic performance. In this work, we provide […]

Ver mais

Like 0

Liked Liked

technocracy

This Is What a Production RAG Stack Actually Looks Like

digitado ⋅ 23 de May de 2026

The failures usually start earlier and later: bad parsing, sloppy chunks, stale metadata, duplicate context, missing evals, and no… Continue reading on Towards AI »

Ver mais

Like 0

Liked Liked

technocracy

MXFormer: A Microscaling Floating-Point Charge-Trap Transistor Compute-in-Memory Transformer Accelerator

digitado ⋅ 16 de February de 2026

arXiv:2602.12480v1 Announce Type: new Abstract: The proliferation of Transformer models is often constrained by the significant computational and memory bandwidth demands of deployment. To address this, we present MXFormer, a novel, hybrid, weight-stationary Compute-in-Memory (CIM) accelerator that provides high throughput and efficiency for fixed-model inference on large short-sequence Transformers. Our architecture’s foundation is the use of ultra-dense Charge-Trap Transistors (CTTs) in Microscaling MXFP4 CIM arrays, uniquely enabling the on-chip storage of up to hundreds of millions of parameters […]

Ver mais

Like 0

Liked Liked

technocracy

Trump picks qualified, normal health leader to head CDC; experts still cautious

digitado ⋅ 17 de April de 2026

President Trump on Thursday announced his third nominee for director of the Centers for Disease Control and Prevention: Dr. Erica Schwartz, a well-qualified former public health official and board-certified physician in preventive medicine, who has publicly supported vaccination and followed evidence-based medicine. The uncontroversial pick comes amid concern within the administration that the aggressive anti-vaccine agenda from Health Secretary Robert F. Kennedy Jr.—who has no medical, science, or public health background—has become a liability for the party in […]

Ver mais

Like 0

Liked Liked

technocracy

Load Block Modeling in Distribution Systems: Network Reconfiguration for Load Restoration

digitado ⋅ 20 de April de 2026

arXiv:2604.15480v1 Announce Type: new Abstract: The distribution system restoration (DSR) problem has received considerable attention over the last decade or more. Solutions to the DSR problem identify the best set or sequence of actions to perform on a distribution circuit to restore service after a disruption. The problem is challenging from a computational perspective, with engineering constraints specific to distribution systems, such as radial operations, that are difficult to effectively model. In this paper, we revisit the model […]

Ver mais

Like 0

Liked Liked

technocracy

A Semi-Automated Framework for 3D Reconstruction of Medieval Manuscript Miniatures

digitado ⋅ 13 de April de 2026

arXiv:2604.08610v1 Announce Type: new Abstract: This paper presents a semi-automated framework for transforming two-dimensional miniatures from medieval manuscripts into three-dimensional digital models suitable for extended reality (XR), tactile 3D~printing, and web-based visualization. We evaluate seven image-to-3D methods (TripoSR, SF3D, SPAR3D, TRELLIS, Wonder3D, SAM~3D, Hi3DGen) on 69~manuscript figures from two collections using rendering-based metrics (Silhouette IoU, LPIPS, CLIP~Score) and volumetric measures (Depth Range Ratio, watertight percentage), revealing a trade-off between volumetric expansion and geometric fidelity. Hi3DGen balances topological quality […]

Ver mais

Like 0

Liked Liked

technocracy

I just updated my RL notes!

digitado ⋅ 25 de June de 2026

https://github.com/roboticcam/machine-learning-notes It included both the foundational knowledge such as policy gradient theorem as well as the latest such as GRPO. submitted by /u/Delicious_Screen_789 [link] [comments]

Ver mais

Like 0

Liked Liked

technocracy

Sharpness of Minima in Deep Matrix Factorization

digitado ⋅ 29 de January de 2026

arXiv:2509.25783v5 Announce Type: replace Abstract: Understanding the geometry of the loss landscape near a minimum is key to explaining the implicit bias of gradient-based methods in non-convex optimization problems such as deep neural network training and deep matrix factorization. A central quantity to characterize this geometry is the maximum eigenvalue of the Hessian of the loss. Currently, its precise role has been obfuscated because no exact expressions for this sharpness measure were known in general settings. In this […]

Ver mais

Like 0

Liked Liked

technocracy

Introducing Provable Randomness in Beldex Consensus with Verifiable Random Functions

digitado ⋅ 5 de February de 2026

The concept of Verifiable Random Functions (VRFs) was first introduced over two decades ago in 1999, by cryptographers Silvio Micali, Michael O. Rabin, and Salil Vadhan. Back in the day, it was considered a generational breakthrough in cryptography. It gave cryptographers a way to produce randomness that anyone can verify, yet no one can predict or manipulate. Today, the same VRFs, with a little bit of modification, are at the center of modern blockchain networks, ensuring fair, secure, […]

Ver mais

Like 0

Liked Liked

technocracy

Examining Fast Radiative Feedbacks Using Machine-Learning Weather Emulators

digitado ⋅ 18 de February de 2026

The response of the climate system to increased greenhouse gases and other radiative perturbations is governed by a combination of fast and slow feedbacks. Slow feedbacks are typically activated in response to changes in ocean temperatures on decadal timescales and manifest as changes in climatic state with no recent historical analogue. However, fast feedbacks are activated in response to rapid atmospheric physical processes on weekly timescales, and they are already operative in the present-day climate. This distinction implies […]

Ver mais

Like 0

Liked Liked