digitado

Smoothing DiLoCo with Primal Averaging for Faster Training of LLMs

digitado ⋅ 2 de March de 2026

arXiv:2512.17131v3 Announce Type: replace-cross Abstract: We propose Generalized Primal Averaging (GPA), an extension of Nesterov’s method that unifies and generalizes recent averaging-based optimizers like single-worker DiLoCo and Schedule-Free, within a non-distributed setting. While DiLoCo relies on a memory-intensive two-loop structure to periodically aggregate pseudo-gradients using Nesterov momentum, GPA eliminates this complexity by decoupling Nesterov’s interpolation constants to enable smooth iterate averaging at every step. Structurally, GPA resembles Schedule-Free but replaces uniform averaging with exponential moving averaging. Empirically, GPA […]

Ver mais

Like 0

Liked Liked

technocracy

ECHO-2: A Large-Scale Distributed Rollout Framework for Cost-Efficient Reinforcement Learning

digitado ⋅ 2 de February de 2026

Reinforcement learning (RL) is a critical stage in post-training large language models (LLMs), involving repeated interaction between rollout generation, reward evaluation, and centralized learning. Distributing rollout execution offers opportunities to leverage more cost-efficient inference resources, but introduces challenges in wide-area coordination and policy dissemination. We present ECHO-2, a distributed RL framework for post-training with remote inference workers and non-negligible dissemination latency. ECHO-2 combines centralized learning with distributed rollouts and treats bounded policy staleness as a user-controlled parameter, enabling […]

Ver mais

Like 0

Liked Liked

technocracy

Using Small Language Models to Reverse-Engineer Machine Learning Pipelines Structures

digitado ⋅ 7 de January de 2026

Background: Extracting the stages that structure Machine Learning (ML) pipelines from source code is key for gaining a deeper understanding of data science practices. However, the diversity caused by the constant evolution of the ML ecosystem (e.g., algorithms, libraries, datasets) makes this task challenging. Existing approaches either depend on non-scalable, manual labeling, or on ML classifiers that do not properly support the diversity of the domain. These limitations highlight the need for more flexible and reliable solutions. Objective: […]

Ver mais

Like 0

Liked Liked

technocracy

Engaging the AI community through building, research, and shared learning

digitado ⋅ 2 de February de 2026

Engaging the AI community through building, research, and shared learning Machine learning Staff writer February 02, 02:51 PM February 02, 03:34 PM Advancing AI requires more than breakthrough models. It depends on communities of builders and researchers who experiment, test assumptions, and share what they learn. That belief is guiding how Amazon engages developers and academics around Amazon Nova, Amazons portfolio of AI offerings including the Nova models, Nova Forge and Nova Act. Today, two Nova initiatives launch […]

Ver mais

Like 0

Liked Liked

technocracy

Why MP4 Video Is Broken for the Modern Web

digitado ⋅ 5 de January de 2026

MP4 video was built for broadcast, not for the dynamic, data-driven web. As video becomes the primary interface online, static files can’t personalize, adapt, or respond in real time. Blings proposes a runtime video model (MP5) where video is assembled on the fly like software—interactive, composable, privacy-first, and personalized for every viewer.

Ver mais

Like 0

Liked Liked

technocracy

What do you think about this paper on Computer-Using World Model?

digitado ⋅ 21 de February de 2026

I’m talking about the claims in this RL paper – I personally like it, but dispute the STRUCTURE-AWARE REINFORCEMENT LEARNING FOR TEXTUAL TRANSITIONS, how they justify it. I like the WORLD-MODEL-GUIDED TEST-TIME ACTION SEARCH Paper – https://arxiv.org/pdf/2602.17365 My comments – https://trybibby.com/view/project/4395c445-477b-439e-b7e6-5b8b24734e92 https://preview.redd.it/3utmvy2t3ukg1.png?width=1953&format=png&auto=webp&s=7fd99059c883336e35d64c64d7bcec37c9988f6e Would love to know your thoughts on the paper? submitted by /u/nilofering [link] [comments]

Ver mais

Like 0

Liked Liked

technocracy

Adaptive Workflow Allocation in Human–Machine Cooperative Anti-Money Laundering Operations

digitado ⋅ 31 de December de 2025

This study develops an adaptive workflow allocation mechanism for anti-money laundering (AML) operations, aiming to improve the accuracy and efficiency of suspicious-transaction review. A multi-agent simulation platform was constructed to model transaction flows, alert generation, and analyst decision behaviors. The system integrates model-confidence estimation, analyst-fatigue prediction, and real-time workload signals to dynamically route alerts. Experiments were conducted using 27.3 million historical transactions and 186,000 alerts from a large commercial financial dataset. Compared with fixed allocation rules, the adaptive […]

Ver mais

Like 0

Liked Liked

technocracy

A Survey of popular LLM Evaluation Metrics

digitado ⋅ 20 de August de 2025

Large Language Models (LLMs) are increasingly applied to critical domains such as medical report generation, where accuracy and trust are essential. Evaluating the quality of generated text is non-trivial: surface word matches may miss key semantic errors, while semantic metrics may overlook domain-specific mistakes. This review goes through five categories of evaluation metrics, using a consistent medical example to illustrate their differences: Reference report: “The chest X-ray shows evidence of pneumonia. No pleural effusion is present.” Generated report: […]

Ver mais

Like 0

Liked Liked

technocracy

A Peek at Trends in Machine Learning

digitado ⋅ 7 de April de 2017

Have you looked at Google Trends? It’s pretty cool — you enter some keywords and see how Google Searches of that term vary through time. I thought — hey, I happen to have this arxiv-sanity database of 28,303 (arxiv) Machine Learning papers over the last 5 years, so why not do something similar and take a look at how Machine Learning research has evolved over the last 5 years? The results are fairly fun, so I thought I’d post. (Edit: machine learning is […]

Ver mais

Like 0

Liked Liked

technocracy

Trump orders the military to make agreements with coal power plants

digitado ⋅ 12 de February de 2026

On Wednesday, a fossil-fuel lobbying group called the Washington Coal Club awarded President Trump a trophy that named him the “Undisputed Champion of Clean, Beautiful Coal.” Trump took advantage of the opportunity to take his latest shot at reviving the fortunes of the US’s most polluting source of electricity: an executive order that would make the military buy it. Coal is the second most expensive source of power for the US grid, eclipsed by gas, wind, solar, hydro—everything […]

Ver mais

Like 0

Liked Liked