digitado

On the Learning Dynamics of RLVR at the Edge of Competence

digitado ⋅ 17 de February de 2026

arXiv:2602.14872v1 Announce Type: cross Abstract: Reinforcement learning with verifiable rewards (RLVR) has been a main driver of recent breakthroughs in large reasoning models. Yet it remains a mystery how rewards based solely on final outcomes can help overcome the long-horizon barrier to extended reasoning. To understand this, we develop a theory of the training dynamics of RL for transformers on compositional reasoning tasks. Our theory characterizes how the effectiveness of RLVR is governed by the smoothness of the […]

Ver mais

Like 0

Liked Liked

technocracy

Helping power-system planners prepare for an unknown future

digitado ⋅ 3 de December de 2025

A new computer modeling tool developed by an MIT Energy Initiative (MITEI) research team will help infrastructure planners working in the electricity and other energy-intensive sectors better predict and prepare for future needs and conditions as they develop plans for power generation capacity, transmission lines, and other necessary infrastructure. The tool could reduce the amount of time this planning takes and help ensure that the power grid can continue to provide customers with efficient, reliable, and low-cost electricity […]

Ver mais

Like 0

Liked Liked

technocracy

Setting Up TensorFlow with GPU (CUDA): A Step-by-Step Installation Guide

digitado ⋅ 5 de January de 2026

Author(s): Muaaz Originally published on Towards AI. If you are writing Deep Learning code on a machine with a GPU, TensorFlow will default to running on the CPU. This happens because TensorFlow does not automatically select the best hardware. To use the GPU, you must specify it manually. To run TensorFlow code on a GPU, you don’t need any extra setup beyond installing the GPU-enabled version of TensorFlow. However, if you are using Windows, you must install Windows […]

Ver mais

Like 0

Liked Liked

technocracy

Learning Rate Matters: Vanilla LoRA May Suffice for LLM Fine-tuning

digitado ⋅ 4 de February de 2026

Low-Rank Adaptation (LoRA) is the prevailing approach for efficient large language model (LLM) fine-tuning. Building on this paradigm, recent studies have proposed alternative initialization strategies and architectural modifications, reporting substantial improvements over vanilla LoRA. However, these gains are often demonstrated under fixed or narrowly tuned hyperparameter settings, despite the known sensitivity of neural networks to training configurations. In this work, we systematically re-evaluate four representative LoRA variants alongside vanilla LoRA through extensive hyperparameter searches. Across mathematical and code […]

Ver mais

Like 0

Liked Liked

technocracy

How We Built a 99% Accurate Invoice Processing System Using OCR and LLMs

digitado ⋅ 8 de February de 2026

We had a working RAG solution at 91% accuracy. Here’s why we rebuilt it with fine-tuning and what we learned along the way. Our client was spending eight minutes per invoice on manual data entry. At 10,000 invoices a month, that’s a full team doing nothing but copying numbers from PDFs into a database. We were building an invoice processing system for a US healthcare client. The goal was straightforward — extract line items, medical codes, and billing information from unstructured […]

Ver mais

Like 0

Liked Liked

technocracy

2024 Year in Review

digitado ⋅ 10 de January de 2025

At the end of 2022 I wrapped up my contract work with Help Scout and took the plunge to work on my indie software businesses full time. I’m now two years into that adventure, and wanted to share a periodic update about how things are going. Preceden on the Back Burner I made 32 commits to Preceden, my timeline maker software, the entire year. Those commits were all small tweaks like switching the AI timeline generator from gpt-3.5-turbo […]

Ver mais

Like 0

Liked Liked

technocracy

Certified Unlearning in Decentralized Federated Learning

digitado ⋅ 10 de January de 2026

Driven by the right to be forgotten (RTBF), machine unlearning has become an essential requirement for privacy-preserving machine learning. However, its realization in decentralized federated learning (DFL) remains largely unexplored. In DFL, clients exchange local updates only with neighbors, causing model information to propagate and mix across the network. As a result, when a client requests data deletion, its influence is implicitly embedded throughout the system, making removal difficult without centralized coordination. We propose a novel certified unlearning […]

Ver mais

Like 0

Liked Liked

technocracy

Byzantine-Robust Optimization under $(L_0, L_1)$-Smoothness

digitado ⋅ 16 de March de 2026

arXiv:2603.12512v1 Announce Type: new Abstract: We consider distributed optimization under Byzantine attacks in the presence of $(L_0,L_1)$-smoothness, a generalization of standard $L$-smoothness that captures functions with state-dependent gradient Lipschitz constants. We propose Byz-NSGDM, a normalized stochastic gradient descent method with momentum that achieves robustness against Byzantine workers while maintaining convergence guarantees. Our algorithm combines momentum normalization with Byzantine-robust aggregation enhanced by Nearest Neighbor Mixing (NNM) to handle both the challenges posed by $(L_0,L_1)$-smoothness and Byzantine adversaries. We prove […]

Ver mais

Like 0

Liked Liked

technocracy

Boreas Road Trip: A Multi-Sensor Autonomous Driving Dataset on Challenging Roads

digitado ⋅ 20 de February de 2026

arXiv:2602.16870v1 Announce Type: new Abstract: The Boreas Road Trip (Boreas-RT) dataset extends the multi-season Boreas dataset to new and diverse locations that pose challenges for modern autonomous driving algorithms. Boreas-RT comprises 60 sequences collected over 9 real-world routes, totalling 643 km of driving. Each route is traversed multiple times, enabling evaluation in identical environments under varying traffic and, in some cases, weather conditions. The data collection platform includes a 5MP FLIR Blackfly S camera, a 360 degree Navtech […]

Ver mais

Like 0

Liked Liked

technocracy

Vector Databases: Unlocking the Future of Intelligent AI and Semantic Search

digitado ⋅ 15 de January de 2026

Author(s): Hayanan Originally published on Towards AI. How AI Learns Meaning, Not Just Keywords Modern AI syst‍ems are no longe‍r judged by how fast they retri‌e‍ve data, but by how w‍e‍ll t‍hey understand it. As use‍rs interact with applications in increas⁠ingly natural ways typing vague descriptions, asking open-en‌ded questions, or ev‌en up‍loadin‍g image‍s trad‍ition‍al keyword⁠-based sea‌rch quic‌kly reaches its limits. Exact ma‍tches fail whe‍n int‍ent is ambiguous, con‍text is im‌plicit, and‌ meaning goes beyond words. This gap between […]

Ver mais

Like 0

Liked Liked