digitado

Transferable Backdoor Attacks for Code Models via Sharpness-Aware Adversarial Perturbation

digitado ⋅ 13 de February de 2026

arXiv:2602.11213v1 Announce Type: new Abstract: Code models are increasingly adopted in software development but remain vulnerable to backdoor attacks via poisoned training data. Existing backdoor attacks on code models face a fundamental trade-off between transferability and stealthiness. Static trigger-based attacks insert fixed dead code patterns that transfer well across models and datasets but are easily detected by code-specific defenses. In contrast, dynamic trigger-based attacks adaptively generate context-aware triggers to evade detection but suffer from poor cross-dataset transferability. Moreover, […]

Ver mais

Like 0

Liked Liked

technocracy

Topological Relational Theory: A Simplicial-Complex View of Functional Dependencies, Lossless Decomposition, and Acyclicity

digitado ⋅ 26 de February de 2026

arXiv:2602.21213v1 Announce Type: new Abstract: We develop a topological lens on relational schema design by encoding functional dependencies (FDs) as simplices of an abstract simplicial complex. This dependency complex exposes multi-attribute interactions and enables homological invariants (Betti numbers) to diagnose cyclic dependency structure. We define Simplicial Normal Form (SNF) as homological acyclicity of the dependency complex in positive dimensions, i.e., vanishing reduced homology for all $n ge 1$. SNF is intentionally weaker than contractibility and does not identify […]

Ver mais

Like 0

Liked Liked

technocracy

Automated Optimization Modeling via a Localizable Error-Driven Perspective

digitado ⋅ 13 de February de 2026

arXiv:2602.11164v1 Announce Type: new Abstract: Automated optimization modeling via Large Language Models (LLMs) has emerged as a promising approach to assist complex human decision-making. While post-training has become a pivotal technique to enhance LLMs’ capabilities in this domain, its effectiveness is severely constrained by the scarcity and underutilization of high-quality training data. However, through a detailed profiling of error patterns across various problem-response pairs drawn from post-training, we identify two fundamental limitations of existing automated optimization modeling approaches: […]

Ver mais

Like 0

Liked Liked

technocracy

Watch Wider and Think Deeper: Collaborative Cross-modal Chain-of-Thought for Complex Visual Reasoning

digitado ⋅ 7 de January de 2026

arXiv:2601.02422v1 Announce Type: new Abstract: Multi-modal reasoning requires the seamless integration of visual and linguistic cues, yet existing Chain-of-Thought methods suffer from two critical limitations in cross-modal scenarios: (1) over-reliance on single coarse-grained image regions, and (2) semantic fragmentation between successive reasoning steps. To address these issues, we propose the CoCoT (Collaborative Coross-modal Thought) frame- work, built upon two key innovations: a) Dynamic Multi-Region Grounding to adaptively detect the most relevant image regions based on the question, and […]

Ver mais

Like 0

Liked Liked

technocracy

Graph-Anchored Knowledge Indexing for Retrieval-Augmented Generation

digitado ⋅ 26 de January de 2026

arXiv:2601.16462v1 Announce Type: new Abstract: Retrieval-Augmented Generation (RAG) has emerged as a dominant paradigm for mitigating hallucinations in Large Language Models (LLMs) by incorporating external knowledge. Nevertheless, effectively integrating and interpreting key evidence scattered across noisy documents remains a critical challenge for existing RAG systems. In this paper, we propose GraphAnchor, a novel Graph-Anchored Knowledge Indexing approach that reconceptualizes graph structures from static knowledge representations into active, evolving knowledge indices. GraphAnchor incrementally updates a graph during iterative retrieval […]

Ver mais

Like 0

Liked Liked

technocracy

Statistical Inference Leveraging Synthetic Data with Distribution-Free Guarantees

digitado ⋅ 19 de February de 2026

arXiv:2509.20345v2 Announce Type: replace-cross Abstract: The rapid proliferation of high-quality synthetic data — generated by advanced AI models or collected as auxiliary data from related tasks — presents both opportunities and challenges for statistical inference. This paper introduces a GEneral Synthetic-Powered Inference (GESPI) framework that wraps around any statistical inference procedure to safely enhance sample efficiency by combining synthetic and real data. Our framework leverages high-quality synthetic data to boost statistical power, yet adaptively defaults to the standard […]

Ver mais

Like 0

Liked Liked

technocracy

A Practical Guide to Multi-Agent Swarms and Automated Evaluation for Content Analysis

digitado ⋅ 26 de February de 2026

This guide builds a content analysis system using Strands multi-agent swarms (sentiment, toxicity, and analysis agents via Ollama Llama 3.1), plus automated evaluation with LLM as judge scoring correctness and relevance.

Ver mais

Like 0

Liked Liked

technocracy

Fighting for the health of the planet with AI

digitado ⋅ 22 de December de 2025

For Priya Donti, childhood trips to India were more than an opportunity to visit extended family. The biennial journeys activated in her a motivation that continues to shape her research and her teaching. Contrasting her family home in Massachusetts, Donti — now the Silverman Family Career Development Professor in the MIT Department of Electrical Engineering and Computer Science (EECS) and a principal investigator at the MIT Laboratory for Information and Decision Systems — was struck by the disparities […]

Ver mais

Like 0

Liked Liked

technocracy

Learning to Explain: Supervised Token Attribution from Transformer Attention Patterns

digitado ⋅ 20 de January de 2026

Explainable AI (XAI) has become critical as transformer-based models are deployed in high-stakes applications including healthcare, legal systems, and financial services, where opacity hinders trust and accountability. Transformers self-attention mechanisms have proven valuable for model interpretability, with attention weights successfully used to understand model focus and behavior (Xu et al., 2015); (Wiegreffe and Pinter, 2019). However, existing attention-based explanation methods rely on manually defined aggregation strategies and fixed attribution rules (Abnar and Zuidema, 2020a); (Chefer et al., 2021), […]

Ver mais

Like 0

Liked Liked

technocracy

Learnings from COBOL modernization in the real world

digitado ⋅ 26 de February de 2026

There’s a lot of excitement right now about AI enabling mainframe application modernization. Boards are paying attention. CIOs are getting asked for a plan. AI is a genuine accelerator for COBOL modernization but to get results, AI needs additional context that source code alone can’t provide.Here’s what we’ve learned working with 400+ enterprise customers: mainframe modernization has two very different halves. The first half is reverse engineering, understanding what your existing systems actually do. The second half is […]

Ver mais

Like 0

Liked Liked