digitado

Synergizing Understanding and Generation with Interleaved Analyzing-Drafting Thinking

digitado ⋅ 26 de February de 2026

arXiv:2602.21435v1 Announce Type: new Abstract: Unified Vision-Language Models (UVLMs) aim to advance multimodal learning by supporting both understanding and generation within a single framework. However, existing approaches largely focus on architectural unification while overlooking the need for explicit interaction between the two capabilities during task solving. As a result, current models treat understanding and generation as parallel skills rather than synergistic processes. To achieve real synergy, we introduce the interleaved Analyzing-Drafting problem-solving loop (AD-Loop), a new think paradigm […]

Ver mais

Like 0

Liked Liked

technocracy

CellFluxRL: Biologically-Constrained Virtual Cell Modeling via Reinforcement Learning

digitado ⋅ 23 de March de 2026

Building virtual cells with generative models to simulate cellular behavior in silico is emerging as a promising paradigm for accelerating drug discovery. However, prior image-based generative approaches can produce implausible cell images that violate basic physical and biological constraints. To address this, we propose to post-train virtual cell models with reinforcement learning (RL), leveraging biologically meaningful evaluators as reward functions. We design seven rewards spanning three categories-biological function, structural validity, and morphological correctness-and optimize the state-of-the-art CellFlux model […]

Ver mais

Like 0

Liked Liked

technocracy

Q&A: MIT SHASS and the future of education in the age of AI

digitado ⋅ 14 de April de 2026

The MIT School of Humanities, Arts, and Social Sciences (SHASS) was founded in 1950 in response to “a new era emerging from social upheaval and the disasters of war,” as outlined in the 1949 Lewis Committee Report. The report’s findings emphasized MIT’s role and responsibility in the new nuclear age, which called for doubling down on genuine “integration” of scientific and technical topics with humanistic scholarship and teaching. Only that way, the committee wrote, could MIT tackle “the most […]

Ver mais

Like 0

Liked Liked

technocracy

Learning Hybrid-Control Policies for High-Precision In-Contact Manipulation Under Uncertainty

digitado ⋅ 21 de April de 2026

Reinforcement learning-based control policies have been frequently demonstrated to be more effective than analytical techniques for many manipulation tasks. Commonly, these methods learn neural control policies that predict end-effector pose changes directly from observed state information. For tasks like inserting delicate connectors which induce force constraints, pose-based policies have limited explicit control over force and rely on carefully tuned low-level controllers to avoid executing damaging actions. In this work, we present hybrid position-force control policies that learn to […]

Ver mais

Like 0

Liked Liked

technocracy

Introducing Nova Forge SDK, a seamless way to customize Nova models for enterprise AI

digitado ⋅ 18 de March de 2026

Large language models (LLMs) have transformed how we interact with AI, but one size doesn’t fit at all. Out-of-the-box LLMs are trained with broad, general knowledge and improved for a wide range of use cases, but they often fall short when it comes to domain-specific tasks, proprietary workflows, or unique business requirements. Enterprise customers increasingly need specialized LLMs that deeply understand their proprietary data, business processes, and domain-specific terminology. Without customization, you’re forced to choose between accepting generic […]

Ver mais

Like 0

Liked Liked

technocracy

LEGATO: Good Identity Unlearning Is Continuous

digitado ⋅ 9 de January de 2026

arXiv:2601.04282v1 Announce Type: new Abstract: Machine unlearning has become a crucial role in enabling generative models trained on large datasets to remove sensitive, private, or copyright-protected data. However, existing machine unlearning methods face three challenges in learning to forget identity of generative models: 1) inefficient, where identity erasure requires fine-tuning all the model’s parameters; 2) limited controllability, where forgetting intensity cannot be controlled and explainability is lacking; 3) catastrophic collapse, where the model’s retention capability undergoes drastic degradation […]

Ver mais

Like 0

Liked Liked

technocracy

ChatGPT levels up with Health

digitado ⋅ 8 de January de 2026

Read Online | Sign Up | Advertise Good morning, {{ first_name | AI enthusiasts }}. Over 40M people already ask ChatGPT medical questions every day — and now, OpenAI is making those conversations even more personal. A new ChatGPT Health experience pulls in medical records and fitness data for tailored advice, landing right as AI-driven diagnostics, prescriptions, and FDA-approved devices are set to usher in a completely new era of personalized care. In today’s AI rundown: OpenAI’s dedicated […]

Ver mais

Like 0

Liked Liked

technocracy

LOCUS: Low-Dimensional Model Embeddings for Efficient Model Exploration, Comparison, and Selection

digitado ⋅ 30 de January de 2026

arXiv:2601.21082v1 Announce Type: new Abstract: The rapidly growing ecosystem of Large Language Models (LLMs) makes it increasingly challenging to manage and utilize the vast and dynamic pool of models effectively. We propose LOCUS, a method that produces low-dimensional vector embeddings that compactly represent a language model’s capabilities across queries. LOCUS is an attention-based approach that generates embeddings by a deterministic forward pass over query encodings and evaluation scores via an encoder model, enabling seamless incorporation of new models […]

Ver mais

Like 0

Liked Liked

technocracy

Preventing Learning Stagnation in PPO by Scaling to 1 Million Parallel Environments

digitado ⋅ 6 de March de 2026

Plateaus, where an agent’s performance stagnates at a suboptimal level, are a common problem in deep on-policy RL. Focusing on PPO due to its widespread adoption, we show that plateaus in certain regimes arise not because of known exploration, capacity, or optimization challenges, but because sample-based estimates of the loss eventually become poor proxies for the true objective over the course of training. As a recap, PPO switches between sampling rollouts from several parallel environments online using the […]

Ver mais

Like 0

Liked Liked

technocracy

Learning functional components of PDEs from data using neural networks

digitado ⋅ 13 de February de 2026

Partial differential equations often contain unknown functions that are difficult or impossible to measure directly, hampering our ability to derive predictions from the model. Workflows for recovering scalar PDE parameters from data are well studied: here we show how similar workflows can be used to recover functions from data. Specifically, we embed neural networks into the PDE and show how, as they are trained on data, they can approximate unknown functions with arbitrary accuracy. Using nonlocal aggregation-diffusion equations […]

Ver mais

Like 0

Liked Liked