digitado

Synthetic Visual Genome 2: Extracting Large-scale Spatio-Temporal Scene Graphs from Videos

digitado ⋅ 2 de March de 2026

arXiv:2602.23543v1 Announce Type: new Abstract: We introduce Synthetic Visual Genome 2 (SVG2), a large-scale panoptic video scene graph dataset. SVG2 contains over 636K videos with 6.6M objects, 52.0M attributes, and 6.7M relations, providing an order-of-magnitude increase in scale and diversity over prior spatio-temporal scene graph datasets. To create SVG2, we design a fully automated pipeline that combines multi-scale panoptic segmentation, online-offline trajectory tracking with automatic new-object discovery, per-trajectory semantic parsing, and GPT-5-based spatio-temporal relation inference. Building on this […]

Ver mais

Like 0

Liked Liked

technocracy

Riemannian Lyapunov Optimizer: A Unified Framework for Optimization

digitado ⋅ 2 de February de 2026

arXiv:2601.22284v1 Announce Type: new Abstract: We introduce Riemannian Lyapunov Optimizers (RLOs), a family of optimization algorithms that unifies classic optimizers within one geometric framework. Unlike heuristic improvements to existing optimizers, RLOs are systematically derived from a novel control-theoretic framework that reinterprets optimization as an extended state discrete-time controlled dynamical system on a Riemannian parameter manifold. Central to this framework is the identification of a Normally Attracting Invariant Manifold (NAIM), which organizes training dynamics into two distinct stages: rapid […]

Ver mais

Like 0

Liked Liked

technocracy

Adaptive Machine Learning Framework for QoT Approximation Using Link-Level Embeddings

digitado ⋅ 13 de March de 2026

The management of today’s optical networks is highly dependent on the correct estimation of Quality of Transmission (QoT). The current analytical approach requires exact physical values, which are often not available, resulting in inefficient management of the network. This paper proposes an Adaptive Machine Learning Framework that aims to address the analytical approach’s limitations using a new and innovative data-driven approach. The proposed framework combines linklevel embeddings with an Artificial Neural Network (ANN) to process the unique sequence […]

Ver mais

Like 0

Liked Liked

technocracy

Authentication Customer Segmentation — BEACON: K-Means Clustering

digitado ⋅ 13 de February de 2026

What is BEACON? BEACON (Behavioral Evaluation for Authentication Cohorts & Outcomes Network) is a data-driven framework that segments customers based on their authentication behavior using machine learning (k-means clustering). It groups users along interpretable dimensions of engagement and risk to better understand login behavior, fraud patterns, and retention outcomes. How do customers engage with the product/system? Customers engage with BEACON indirectly through their authentication behaviors — such as login frequency, success rates, friction encountered (e.g., challenge verification), and security-related interactions. Their […]

Ver mais

Like 0

Liked Liked

technocracy

From San Francisco to the Sands: Why U.S. Tech Talent Is Eyeing the UAE

digitado ⋅ 28 de February de 2026

If you hang around startup circles in the Bay Area long enough, you’ll start hearing something unexpected between funding rounds and AI debates: founders quietly Googling “car rental in UAE” and checking flight prices to Dubai and Abu Dhabi. What started as curiosity has turned into a real trend. From San Francisco to the sands of the Arabian Peninsula, American tech talent is seriously eyeing the United Arab Emirates—and not just for a quick conference or a flashy […]

Ver mais

Like 0

Liked Liked

technocracy

Reports of ad-supported Xbox game streams show Microsoft’s lack of imagination

digitado ⋅ 19 de January de 2026

Currently, Microsoft’s long-running Cloud Gaming service is limited to players that have a Microsoft’s Game Pass subscription. Now, new reporting suggests Microsoft is planning to offer non-subscribers access to game streams paid for by advertising in the near future, but only in extremely limited circumstances. The latest wave of rumors was set off late last week when The Verge’s Tom Warren shared an Xbox Cloud Gaming loading screen with a message mentioning “1 hour of ad supported playtime […]

Ver mais

Like 0

Liked Liked

technocracy

A Scalable Measure of Loss Landscape Curvature for Analyzing the Training Dynamics of LLMs

digitado ⋅ 26 de January de 2026

arXiv:2601.16979v1 Announce Type: cross Abstract: Understanding the curvature evolution of the loss landscape is fundamental to analyzing the training dynamics of neural networks. The most commonly studied measure, Hessian sharpness ($lambda_{max}^H$) — the largest eigenvalue of the loss Hessian — determines local training stability and interacts with the learning rate throughout training. Despite its significance in analyzing training dynamics, direct measurement of Hessian sharpness remains prohibitive for Large Language Models (LLMs) due to high computational cost. We analyze […]

Ver mais

Like 0

Liked Liked

technocracy

LLM-Based Agentic Systems for Software Engineering: Challenges and Opportunities

digitado ⋅ 16 de January de 2026

arXiv:2601.09822v1 Announce Type: new Abstract: Despite recent advancements in Large Language Models (LLMs), complex Software Engineering (SE) tasks require more collaborative and specialized approaches. This concept paper systematically reviews the emerging paradigm of LLM-based multi-agent systems, examining their applications across the Software Development Life Cycle (SDLC), from requirements engineering and code generation to static code checking, testing, and debugging. We delve into a wide range of topics such as language model selection, SE evaluation benchmarks, state-of-the-art agentic frameworks […]

Ver mais

Like 0

Liked Liked

technocracy

ProFocus: Proactive Perception and Focused Reasoning in Vision-and-Language Navigation

digitado ⋅ 9 de March de 2026

arXiv:2603.05530v1 Announce Type: new Abstract: Vision-and-Language Navigation (VLN) requires agents to accurately perceive complex visual environments and reason over navigation instructions and histories. However, existing methods passively process redundant visual inputs and treat all historical contexts indiscriminately, resulting in inefficient perception and unfocused reasoning. To address these challenges, we propose textbf{ProFocus}, a training-free progressive framework that unifies underline{Pro}active Perception and underline{Focus}ed Reasoning through collaboration between large language models (LLMs) and vision-language models (VLMs). For proactive perception, ProFocus transforms […]

Ver mais

Like 0

Liked Liked

technocracy

Learning Deterministic Finite-State Machines from the Prefixes of a Single String is NP-Complete

digitado ⋅ 19 de January de 2026

It is well known that computing a minimum DFA consistent with a given set of positive and negative examples is NP-hard. Previous work has identified conditions on the input sample under which the problem becomes tractable or remains hard. In this paper, we study the computational complexity of the case where the input sample is prefix-closed. This formulation is equivalent to computing a minimum Moore machine consistent with observations along its runs. We show that the problem is […]

Ver mais

Like 0

Liked Liked