digitado

Self-Aware Knowledge Probing: Evaluating Language Models’ Relational Knowledge through Confidence Calibration

digitado ⋅ 28 de January de 2026

arXiv:2601.18901v1 Announce Type: new Abstract: Knowledge probing quantifies how much relational knowledge a language model (LM) has acquired during pre-training. Existing knowledge probes evaluate model capabilities through metrics like prediction accuracy and precision. Such evaluations fail to account for the model’s reliability, reflected in the calibration of its confidence scores. In this paper, we propose a novel calibration probing framework for relational knowledge, covering three modalities of model confidence: (1) intrinsic confidence, (2) structural consistency and (3) semantic […]

Ver mais

Like 0

Liked Liked

technocracy

TeleWorld: Towards Dynamic Multimodal Synthesis with a 4D World Model

digitado ⋅ 6 de January de 2026

arXiv:2601.00051v1 Announce Type: new Abstract: World models aim to endow AI systems with the ability to represent, generate, and interact with dynamic environments in a coherent and temporally consistent manner. While recent video generation models have demonstrated impressive visual quality, they remain limited in real-time interaction, long-horizon consistency, and persistent memory of dynamic scenes, hindering their evolution into practical world models. In this report, we present TeleWorld, a real-time multimodal 4D world modeling framework that unifies video generation, […]

Ver mais

Like 0

Liked Liked

technocracy

Dataset Distillation as Pushforward Optimal Quantization

digitado ⋅ 9 de February de 2026

arXiv:2501.07681v3 Announce Type: replace-cross Abstract: Dataset distillation aims to find a synthetic training set such that training on the synthetic data achieves similar performance to training on real data, with orders of magnitude less computational requirements. Existing methods can be broadly categorized as either bi-level optimization problems that have neural network training heuristics as the lower level problem, or disentangled methods that bypass the bi-level optimization by matching distributions of data. The latter method has the major advantages […]

Ver mais

Like 0

Liked Liked

technocracy

Warner Bros. sticks with Netflix merger, calls Paramount’s $108B bid “illusory”

digitado ⋅ 7 de January de 2026

The Warner Bros. Discovery board has unanimously voted to rebuff Paramount’s $108.4 billion offer and urged shareholders to reject the hostile takeover bid. The board is continuing to support Netflix’s pending $82.7 billion purchase of its streaming and movie studios businesses along with a separate spinoff of the Warner Bros. cable TV division. Warner Bros. called the Paramount bid “illusory” in a presentation for shareholders today, saying the offer requires an “extraordinary amount of debt financing” and other […]

Ver mais

Like 0

Liked Liked

technocracy

Learning to Compose for Cross-domain Agentic Workflow Generation

digitado ⋅ 11 de February de 2026

Automatically generating agentic workflows — executable operator graphs or codes that orchestrate reasoning, verification, and repair — has become a practical way to solve complex tasks beyond what single-pass LLM generation can reliably handle. Yet what constitutes a good workflow depends heavily on the task distribution and the available operators. Under domain shift, current systems typically rely on iterative workflow refinement to discover a feasible workflow from a large workflow space, incurring high iteration costs and yielding unstable, […]

Ver mais

Like 0

Liked Liked

technocracy

Unsupervised Video Class-Incremental Learning via Deep Embedded Clustering Management

digitado ⋅ 20 de January de 2026

Unsupervised video class incremental learning (uVCIL) represents an important learning paradigm for learning video information without forgetting, and without considering any data labels. Prior approaches have focused on supervised class-incremental learning, relying on using the knowledge of labels and task boundaries, which is costly, requires human annotation, or is simply not a realistic option. In this paper, we propose a simple yet effective approach to address the uVCIL. We first consider a deep feature extractor network, providing a […]

Ver mais

Like 0

Liked Liked

technocracy

A Hybrid Quantum-Classical Machine Learning Framework for Early and Accurate Diagnosis of Chronic Diseases

digitado ⋅ 7 de January de 2026

Chronic diseases—such as diabetes, cardiovascular disorders, and chronic respiratory conditions—account for over 70% of global deaths annually, with late diagnosis being a primary contributor to poor outcomes. While machine learning (ML) models have shown promise in early detection, they often suffer from limited generalizability, data heterogeneity, and insufficient interpretability in clinical settings. This paper introduces a novel hybrid quantum-classical machine learning (HQML) framework that synergistically combines the pattern recognition power of classical deep neural networks with the high-dimensional […]

Ver mais

Like 0

Liked Liked

technocracy

Continuum Robot Localization using Distributed Time-of-Flight Sensors

digitado ⋅ 10 de February de 2026

arXiv:2602.07209v1 Announce Type: new Abstract: Localization and mapping of an environment are crucial tasks for any robot operating in unstructured environments. Time-of-flight (ToF) sensors (e.g.,~lidar) have proven useful in mobile robotics, where high-resolution sensors can be used for simultaneous localization and mapping. In soft and continuum robotics, however, these high-resolution sensors are too large for practical use. This, combined with the deformable nature of such robots, has resulted in continuum robot (CR) localization and mapping in unstructured environments […]

Ver mais

Like 0

Liked Liked

technocracy

Neural Architecture Search

digitado ⋅ 6 de August de 2020

Although most popular and successful model architectures are designed by human experts, it doesn’t mean we have explored the entire network architecture space and settled down with the best option. We would have a better chance to find the optimal solution if we adopt a systematic and automatic way of learning high-performance model architectures.

Ver mais

Like 0

Liked Liked

technocracy

Comparative Evaluation of Deep Learning-Based and WHO-Informed Approaches for Sperm Morphology Assessment

digitado ⋅ 15 de January de 2026

Assessment of sperm morphological quality remains a critical yet subjective component of male fertility evaluation, often limited by inter-observer variability and resource constraints. This study presents a comparative biomedical artificial intelligence framework evaluating an image-based deep learning model (HuSHeM) alongside a clinically grounded baseline derived from World Health Organization criteria augmented with the Systemic Inflammation Response Index (WHO(+SIRI)). The HuSHeM model was trained on high-resolution sperm morphology images and evaluated using an independent clinical cohort. Model performance was […]

Ver mais

Like 0

Liked Liked