digitado

SpaceX gets FCC permission to launch another 7,500 Starlink satellites

digitado ⋅ 10 de January de 2026

SpaceX today received US permission to launch another 7,500 second-generation Starlink satellites, bringing its total authorization to 15,000 Gen2 satellites including those previously approved. “Under this grant, SpaceX is authorized to construct, deploy, and operate an additional 7,500 Gen2 Starlink satellites, bringing the total to 15,000 satellites worldwide,” the Federal Communications Commission announced today. “This expansion will enable SpaceX to deliver high-speed, low-latency Internet service globally, including enhanced mobile and supplemental coverage from space.” The FCC gave SpaceX permission […]

Ver mais

Like 0

Liked Liked

technocracy

Robust AI Evaluation through Maximal Lotteries

digitado ⋅ 26 de February de 2026

arXiv:2602.21297v1 Announce Type: new Abstract: The standard way to evaluate language models on subjective tasks is through pairwise comparisons: an annotator chooses the “better” of two responses to a prompt. Leaderboards aggregate these comparisons into a single Bradley-Terry (BT) ranking, forcing heterogeneous preferences into a total order and violating basic social-choice desiderata. In contrast, social choice theory provides an alternative approach called maximal lotteries, which aggregates pairwise preferences without imposing any assumptions on their structure. However, we show […]

Ver mais

Like 0

Liked Liked

technocracy

Meta-Research on Backdoors: Dataset and Threat Model Shifts in Multimodal Backdoor Attacks

digitado ⋅ 7 de April de 2026

Backdoor attacks enable adversaries to embed malicious behavior into machine learning models by poisoning training data with triggers. Researchers focused largely on backdoors in unimodal models. However, the rise of multimodal systems, e.g., vision–language models (VLMs) and multimodal large language models (MLLMs), has significantly increased the attack surface. Multimodal backdoors can exploit cross-modal triggers, representation-level manipulation, instruction-conditioned behaviors, and test-time activation pathways that are not available in unimodal models. Nevertheless, quantifying progress in this field remains challenging due […]

Ver mais

Like 0

Liked Liked

technocracy

Why BTCC’s $5.7 Billion Gold Trading Surge Signals a Turning Point for Real-World Assets in Crypto

digitado ⋅ 16 de January de 2026

When cryptocurrency emerged with promises of replacing traditional finance and creating entirely new asset classes, few predicted that digital gold tokens would become one of the most traded instruments on exchanges like BTCC. Yet 2025 data reveals a striking pattern that challenges common assumptions about what crypto users actually want when markets turn uncertain. BTCC exchange crossing $5.72 billion in tokenized gold trading volume represents more than a statistical milestone. The 809% volume increase from the first quarter […]

Ver mais

Like 0

Liked Liked

technocracy

Phase 2 Calibration: Fixing Gating and Reward Scoring Together

digitado ⋅ 26 de March de 2026

I didn’t add per‑category OOD thresholds because it was academically elegant. I added them because my baseline runs were telling me the same story over and over: some prompt categories were systematically getting mis-gated by a single global uncertainty threshold. When that happens, you don’t just waste compute—you route the wrong jobs into the wrong generation strategy, and your downstream scoring starts making “confident” decisions on top of the wrong substrate. Phase 2 calibration in this codebase is […]

Ver mais

Like 0

Liked Liked

technocracy

Thomas Metzinger: Being No One: The Self-Model Theory of Subjectivity

digitado ⋅ 28 de October de 2024

Thomas Metzinger argues that the self is an illusion—a virtual construct the brain generates to manage perception and action. When this self-model becomes transparent, we mistake it for reality and feel like “someone” inside our body. In truth, we’re self-simulating organisms, biological systems so advanced that we’ve come to believe our own virtual reflection is real.

Ver mais

Like 0

Liked Liked

technocracy

Approximating the Permanent of a Random Matrix with Polynomially Small Mean: Zeros and Universality

digitado ⋅ 3 de April de 2026

arXiv:2604.01367v1 Announce Type: new Abstract: We study algorithms for approximating the permanent of a random matrix when the entries are slightly biased away from zero. This question is motivated by the goal of understanding the classical complexity of linear optics and emph{boson sampling} (Aaronson and Arkhipov ’11; Eldar and Mehraban ’17). Barvinok’s interpolation method enables efficient approximation of the permanent, provided one can establish a sufficiently large zero-free region for the polynomial $mathrm{per}(zJ + W)$, where $J$ is […]

Ver mais

Like 0

Liked Liked

technocracy

SQaLe: A Large Text-to-SQL Corpus Grounded in Real Schemas

digitado ⋅ 27 de February de 2026

arXiv:2602.22223v1 Announce Type: new Abstract: Advances in large language models have accelerated progress in text-to-SQL, methods for converting natural language queries into valid SQL queries. A key bottleneck for developing generalizable text-to-SQL models is the lack of large-scale datasets with sufficient schema and query complexity, domain coverage, and task diversity. We introduce SQaLe: a large-scale semi-synthetic text-to-SQL dataset built on 135,875 relational database schemas expanded from a collection of real-world schemas, SchemaPile. We establish a principled generation pipeline […]

Ver mais

Like 0

Liked Liked

technocracy

Stanford Researchers Release OpenJarvis: A Local-First Framework for Building On-Device Personal AI Agents with Tools, Memory, and Learning

digitado ⋅ 13 de March de 2026

Stanford researchers have introduced OpenJarvis, an open-source framework for building personal AI agents that run entirely on-device. The project comes from Stanford’s Scaling Intelligence Lab and is presented as both a research platform and deployment-ready infrastructure for local-first AI systems. Its focus is not only model execution, but also the broader software stack required to make on-device agents usable, measurable, and adaptable over time. Why OpenJarvis? According to the Stanford research team, most current personal AI projects still […]

Ver mais

Like 0

Liked Liked

technocracy

SAGE: Tool-Augmented LLM Task Solving Strategies in Scalable Multi-Agent Environments

digitado ⋅ 16 de January de 2026

arXiv:2601.09750v1 Announce Type: new Abstract: Large language models (LLMs) have proven to work well in question-answering scenarios, but real-world applications often require access to tools for live information or actuation. For this, LLMs can be extended with tools, which are often defined in advance, also allowing for some fine-tuning for specific use cases. However, rapidly evolving software landscapes and individual services require the constant development and integration of new tools. Domain- or company-specific tools can greatly elevate the […]

Ver mais

Like 0

Liked Liked