digitado

Boosting Maximum Entropy Reinforcement Learning via One-Step Flow Matching

digitado ⋅ 2 de February de 2026

Diffusion policies are expressive yet incur high inference latency. Flow Matching (FM) enables one-step generation, but integrating it into Maximum Entropy Reinforcement Learning (MaxEnt RL) is challenging: the optimal policy is an intractable energy-based distribution, and the efficient log-likelihood estimation required to balance exploration and exploitation suffers from severe discretization bias. We propose textbf{F}low-based textbf{L}og-likelihood-textbf{A}ware textbf{M}aximum textbf{E}ntropy RL (textbf{FLAME}), a principled framework that addresses these challenges. First, we derive a Q-Reweighted FM objective that bypasses partition function estimation […]

Ver mais

Like 0

Liked Liked

technocracy

Improving User Experience with Personalized Review Ranking and Summarization

digitado ⋅ 12 de January de 2026

arXiv:2601.05261v1 Announce Type: new Abstract: Online consumer reviews play a crucial role in guiding purchase decisions by offering insights into product quality, usability, and performance. However, the increasing volume of user-generated reviews has led to information overload, making it difficult for consumers to identify content that aligns with their specific preferences. Existing review ranking systems typically rely on metrics such as helpfulness votes, star ratings, and recency, but these fail to capture individual user interests and often treat […]

Ver mais

Like 0

Liked Liked

technocracy

DHS keeps trying and failing to unmask anonymous ICE critics online

digitado ⋅ 23 de January de 2026

The Department of Homeland Security (DHS) has backed down from a fight to unmask the owners of Instagram and Facebook accounts monitoring Immigration and Customs Enforcement (ICE) activity in Pennsylvania. One of the anonymous account holders, John Doe, sued to block ICE from identifying him and other critics online through summonses to Meta that he claimed infringed on core First Amendment-protected activity. DHS initially fought Doe’s motion to quash the summonses, arguing that the community watch groups endangered […]

Ver mais

Like 0

Liked Liked

technocracy

Judge orders Anna’s Archive to delete scraped data; no one thinks it will comply

digitado ⋅ 16 de January de 2026

The operator of WorldCat won a default judgment against Anna’s Archive, with a federal judge ruling yesterday that the shadow library must delete all copies of its WorldCat data and stop scraping, using, storing, or distributing the data. Anna’s Archive is a shadow library and search engine for other shadow libraries that was launched in 2022. It archives books and other written materials and makes them available via torrents, and recently expanded its ambitions by scraping Spotify to […]

Ver mais

Like 0

Liked Liked

technocracy

Learning the Value Systems of Agents with Preference-based and Inverse Reinforcement Learning

digitado ⋅ 4 de February de 2026

Agreement Technologies refer to open computer systems in which autonomous software agents interact with one another, typically on behalf of humans, in order to come to mutually acceptable agreements. With the advance of AI systems in recent years, it has become apparent that such agreements, in order to be acceptable to the involved parties, must remain aligned with ethical principles and moral values. However, this is notoriously difficult to ensure, especially as different human users (and their software […]

Ver mais

Like 0

Liked Liked

technocracy

COMPOT: Calibration-Optimized Matrix Procrustes Orthogonalization for Transformers Compression

digitado ⋅ 18 de February de 2026

arXiv:2602.15200v1 Announce Type: new Abstract: Post-training compression of Transformer models commonly relies on truncated singular value decomposition (SVD). However, enforcing a single shared subspace can degrade accuracy even at moderate compression. Sparse dictionary learning provides a more flexible union-of-subspaces representation, but existing approaches often suffer from iterative dictionary and coefficient updates. We propose COMPOT (Calibration-Optimized Matrix Procrustes Orthogonalization for Transformers), a training-free compression framework that uses a small calibration dataset to estimate a sparse weight factorization. COMPOT employs […]

Ver mais

Like 0

Liked Liked

technocracy

Stability and Convergence of Modal Approximations in Coupled Thermoelastic Systems: Theory and Simulation

digitado ⋅ 10 de February de 2026

arXiv:2602.07224v1 Announce Type: new Abstract: In this work, we review and analyze both the theoretical and numerical aspects of strongly and weakly coupled thermoelastic systems. By employing spectral analysis techniques and establishing uniform resolvent estimates, we derive uniform polynomial decay rates for the associated semigroups under a suitable class of boundary conditions. Particular attention is paid to the role of modal approximations in energy analysis. The theoretical results are complemented by numerical experiments that illustrate how the regularity […]

Ver mais

Like 0

Liked Liked

technocracy

EPA kills foundation of greenhouse gas regulations

digitado ⋅ 12 de February de 2026

In a widely expected move, the Environmental Protection Agency has announced that it is revoking an analysis of greenhouse gases that laid the foundation for regulating their emissions by cars, power plants, and industrial sources. The analysis, called an endangerment finding, was initially ordered by the US Supreme Court in 2007 and completed during the Obama administration; it has, in theory, served as the basis of all government regulations of carbon dioxide emissions since. In practice, lawsuits and […]

Ver mais

Like 0

Liked Liked

technocracy

Are Statistical Methods Obsolete in the Era of Deep Learning? A Study of ODE Inverse Problems

digitado ⋅ 17 de February de 2026

arXiv:2505.21723v2 Announce Type: replace-cross Abstract: In the era of AI, neural networks have become increasingly popular for modeling, inference, and prediction, largely due to their potential for universal approximation. With the proliferation of such deep learning models, a question arises: are leaner statistical methods still relevant? To shed insight on this question, we employ the mechanistic nonlinear ordinary differential equation (ODE) inverse problem as a testbed, using the physics-informed neural network (PINN) as a representative of the deep […]

Ver mais

Like 0

Liked Liked

technocracy

The life of a prescription at Amazon Pharmacy

digitado ⋅ 30 de September de 2024

The life of a prescription at Amazon Pharmacy From pricing estimation and regulatory compliance to inventory management and chatbot assistants, machine learning models help Amazon Pharmacy customers stay healthy and save time and money. Conversational AI Alexandre Alves Anita Vila September 30, 01:32 PM October 02, 11:42 AM Pharmacies play a vital role in ensuring patients health, but the process of dispensing medications is far more complex than it may appear. At Amazon Pharmacy, we are using artificial […]

Ver mais

Like 0

Liked Liked