digitado

KBVQ-MoE: KLT-guided SVD with Bias-Corrected Vector Quantization for MoE Large Language Models

digitado ⋅ 13 de February de 2026

arXiv:2602.11184v1 Announce Type: new Abstract: Mixture of Experts (MoE) models have achieved great success by significantly improving performance while maintaining computational efficiency through sparse expert activation. However, their enormous parameter sizes and memory demands pose major challenges for deployment in resource-constrained environments. Vector Quantization (VQ) offers a promising approach for ultra-low-bit compression in Large Language Models (LLMs) by leveraging a codebook, where weight vectors are mapped to the most similar discrete codewords. Yet, directly applying VQ to MoEs […]

Ver mais

Like 0

Liked Liked

technocracy

Teaching Claude to Remember Part 5: Trust But Verify

digitado ⋅ 22 de January de 2026

Two weeks ago Claude Code 2.1.0 shipped 1,096 commits. Buried in the change log is a feature that changes everything about workflow task management and I don’t hear anyone talking about it. Hi my name is Nick and I am a software engineer with a focus for Web apps. I have been building Web apps since 98. Obtaining a workflow that doesn’t just manage the tasks I need accomplished but one that can also delight me is something I […]

Ver mais

Like 0

Liked Liked

technocracy

Conditional regression for the Nonlinear Single-Variable Model

digitado ⋅ 6 de February de 2026

arXiv:2411.09686v3 Announce Type: replace Abstract: Regressing a function $F$ on $mathbb{R}^d$ without the statistical and computational curse of dimensionality requires special statistical models, for example that impose geometric assumptions on the distribution of the data (e.g., that its support is low-dimensional), or strong smoothness assumptions on $F$, or a special structure $F$. Among the latter, compositional models $F=fcirc g$ with $g$ mapping to $mathbb{R}^r$ with $rll d$ include classical single- and multi-index models, as well as neural networks. […]

Ver mais

Like 0

Liked Liked

technocracy

A Novel Contrastive Loss for Zero-Day Network Intrusion Detection

digitado ⋅ 16 de January de 2026

arXiv:2601.09902v1 Announce Type: new Abstract: Machine learning has achieved state-of-the-art results in network intrusion detection; however, its performance significantly degrades when confronted by a new attack class — a zero-day attack. In simple terms, classical machine learning-based approaches are adept at identifying attack classes on which they have been previously trained, but struggle with those not included in their training data. One approach to addressing this shortcoming is to utilise anomaly detectors which train exclusively on benign data […]

Ver mais

Like 0

Liked Liked

technocracy

The Zero-Day Deduction

digitado ⋅ 20 de January de 2026

2 AM. The screen burned my retinas. Coffee was a memory. The tax-portal.io bug bounty program was a bust. Nothing. Just another dead end in a long line of dead ends. I was ready to quit. Close the laptop. Sleep. One last look at the proxy logs. A flicker in the traffic history. A standard GET request to fetch a user’s documents. My own, from my test account. The URL was clean, but the parameter caught my eye. user_id=1054. An Insecure […]

Ver mais

Like 0

Liked Liked

technocracy

Bi Directional Feedback Fusion for Activity Aware Forecasting of Indoor CO2 and PM2.5

digitado ⋅ 10 de March de 2026

arXiv:2603.06724v1 Announce Type: new Abstract: Indoor air quality (IAQ) forecasting plays a critical role in safeguarding occupant health, ensuring thermal comfort, and supporting intelligent building control. However, predicting future concentrations of key pollutants such as carbon dioxide (CO2) and fine particulate matter (PM2.5) remains challenging due to the complex interplay between environmental factors and highly dynamic occupant behaviours. Traditional data driven models primarily rely on historical sensor trajectories and often fail to anticipate behaviour induced emission spikes or […]

Ver mais

Like 0

Liked Liked

technocracy

IMOVNO+: A Regional Partitioning and Meta-Heuristic Ensemble Framework for Imbalanced Multi-Class Learning

digitado ⋅ 22 de February de 2026

Class imbalance, overlap, and noise degrade data quality, reduce model reliability, and limit generalization. Although widely studied in binary classification, these issues remain underexplored in multi-class settings, where complex inter-class relationships make minority-majority structures unclear and traditional clustering fails to capture distribution shape. Approaches that rely only on geometric distances risk removing informative samples and generating low-quality synthetic data, while binarization approaches treat imbalance locally and ignore global inter-class dependencies. At the algorithmic level, ensembles struggle to integrate […]

Ver mais

Like 0

Liked Liked

technocracy

PRISM: Personalized Refinement of Imitation Skills for Manipulation via Human Instructions

digitado ⋅ 9 de March de 2026

arXiv:2603.05574v1 Announce Type: new Abstract: This paper presents PRISM: an instruction-conditioned refinement method for imitation policies in robotic manipulation. This approach bridges Imitation Learning (IL) and Reinforcement Learning (RL) frameworks into a seamless pipeline, such that an imitation policy on a broad generic task, generated from a set of user-guided demonstrations, can be refined through reinforcement to generate new unseen fine-grain behaviours. The refinement process follows the Eureka paradigm, where reward functions for RL are iteratively generated from […]

Ver mais

Like 0

Liked Liked

technocracy

Exploiting the Degrees of Freedom: Multi-Dimensional Spatially-Coupled Codes Based on Gradient Descent

digitado ⋅ 30 de March de 2026

arXiv:2603.25824v1 Announce Type: new Abstract: Spatially-coupled (SC) codes are a class of low-density parity-check (LDPC) codes that is gaining increasing attention. Multi-dimensional (MD) SC codes are constructed by connecting copies of an SC code via relocations in order to mitigate various sources of non-uniformity and improve performance in many storage and transmission systems. As the number of degrees of freedom in the MD-SC code design increases, appropriately exploiting them becomes more difficult because of the complexity growth of […]

Ver mais

Like 0

Liked Liked

technocracy

Stephen Colbert says CBS forbid interview of Democrat because of FCC threat

digitado ⋅ 17 de February de 2026

Talk show host Stephen Colbert said CBS forbade him from interviewing Texas Democratic Senate candidate James Talarico because of a Federal Communications Commission threat to enforce the equal-time rule on late-night and daytime talk shows. Talarico “was supposed to be here, but we were told in no uncertain terms by our network’s lawyers, who called us directly, that we could not have him on the broadcast,” Colbert said on last night’s episode of The Late Show with Stephen […]

Ver mais

Like 0

Liked Liked