digitado

Unified Framework of Distributional Regret in Multi-Armed Bandits and Reinforcement Learning

digitado ⋅ 6 de May de 2026

We study the distribution of regret in stochastic multi-armed bandits and episodic reinforcement learning through a unified framework. We formalize a distributional regret bound as a probabilistic guarantee that holds uniformly over all confidence levels $δin (0,1]$, thereby characterizing the regret distribution across the full range of $δ$. We present a simple UCBVI-style algorithm with exploration bonus $min{c_{1,k}/N, c_{2,k}/sqrt{N}}$, where $N$ denotes the visit count and $(c_{1,k},c_{2,k})$ are user-specified parameters. For arbitrary parameter sequences, we derive general gap-independent […]

Ver mais

Like 0

Liked Liked

technocracy

Feedback on my EU AI Act Risk Tier Assessor [P]

digitado ⋅ 1 de June de 2026

Hey everyone, hope this is ok to post here. I built a free EU AI Act risk assessment tool and would love some feedback from people who actually know this space. You fill out a 10-question form describing your AI system, it classifies your EU AI Act risk tier, and emails you a PDF report with your applicable Articles and priority actions. Takes about 2 minutes, no account required. https://assessment.aiella.com Eventually I want to build a monitoring SDK […]

Ver mais

Like 0

Liked Liked

technocracy

Improve operational visibility for inference workloads on Amazon Bedrock with new CloudWatch metrics for TTFT and Estimated Quota Consumption

digitado ⋅ 12 de March de 2026

As organizations scale their generative AI workloads on Amazon Bedrock, operational visibility into inference performance and resource consumption becomes critical. Teams running latency-sensitive applications must understand how quickly models begin generating responses. Teams managing high-throughput workloads must understand how their requests consume quota so they can avoid unexpected throttling. Until now, gaining this visibility required custom client-side instrumentation or reactive troubleshooting after issues occurred. Today, we’re announcing two new Amazon CloudWatch metrics for Amazon Bedrock, TimeToFirstToken and EstimatedTPMQuotaUsage. […]

Ver mais

Like 0

Liked Liked

technocracy

AI Detectors Fail Diverse Student Populations: A Mathematical Framing of Structural Detection Limits

digitado ⋅ 24 de March de 2026

arXiv:2603.20254v1 Announce Type: new Abstract: Student experiences and empirical studies report that “black box” AI text detectors produce high false positive rates with disproportionate errors against certain student populations, yet typically theoretical analyses model detection as a test between two known distributions for human and AI prose. This framing omits the structural feature of university assessment whereby an assessor generally does not know the individual student’s writing distribution, making the null hypothesis composite. Standard application of the variational […]

Ver mais

Like 0

Liked Liked

technocracy

Inside the Forward Pass: Pre-Fill, Decode, and the GPU Economics of Serving Large Language Models

digitado ⋅ 17 de February de 2026

Author(s): Utkarsh Mittal Originally published on Towards AI. Why Inference Is the Endgame Pre-training a frontier large language model typically consumes somewhere between 15 trillion and 30 trillion tokens. That sounds like an enormous number — until you do the arithmetic on the inference side. There are roughly 7 to 8 billion people on the planet. If each person sent just one query per day to a model like ChatGPT, and each query consumed approximately 2,000 tokens when […]

Ver mais

Like 0

Liked Liked

technocracy

A Cautionary Tale of Self-Supervised Learning for Imaging Biomarkers: Alzheimer’s Disease Case Study

digitado ⋅ 23 de January de 2026

Discovery of sensitive and biologically grounded biomarkers is essential for early detection and monitoring of Alzheimer’s disease (AD). Structural MRI is widely available but typically relies on hand-crafted features such as cortical thickness or volume. We ask whether self-supervised learning (SSL) can uncover more powerful biomarkers from the same data. Existing SSL methods underperform FreeSurfer-derived features in disease classification, conversion prediction, and amyloid status prediction. We introduce Residual Noise Contrastive Estimation (R-NCE), a new SSL framework that integrates […]

Ver mais

Like 0

Liked Liked

technocracy

Probably Approximately Consensus: On the Learning Theory of Finding Common Ground

digitado ⋅ 23 de April de 2026

A primary goal of online deliberation platforms is to identify ideas that are broadly agreeable to a community of users through their expressed preferences. Yet, consensus elicitation should ideally extend beyond the specific statements provided by users and should incorporate the relative salience of particular topics. We address this issue by modelling consensus as an interval in a one-dimensional opinion space derived from potentially high-dimensional data via embedding and dimensionality reduction. We define an objective that maximizes expected […]

Ver mais

Like 0

Liked Liked

technocracy

BLEG: LLM Functions as Powerful fMRI Graph-Enhancer for Brain Network Analysis

digitado ⋅ 10 de April de 2026

arXiv:2604.07361v1 Announce Type: new Abstract: Graph Neural Networks (GNNs) have been widely used in diverse brain network analysis tasks based on preprocessed functional magnetic resonance imaging (fMRI) data. However, their performances are constrained due to high feature sparsity and inherent limitations of domain knowledge within uni-modal neurographs. Meanwhile, large language models (LLMs) have demonstrated powerful representation capabilities. Combining LLMs with GNNs presents a promising direction for brain network analysis. While LLMs and MLLMs have emerged in neuroscience, integration […]

Ver mais

Like 0

Liked Liked

technocracy

How to Fine-tune Vision Transformers Using PEFT for Video Classification?

digitado ⋅ 16 de January de 2026

How to Fine-tune Vision Transformers (ViT) for Video Classification with PEFT Generated Using Gemini’s Nano Banana Pro Vision Transformers (ViTs) and their variants, such as Timesformer and ViViT, pretrained on video understanding tasks, have set new benchmarks in computer vision. However, their enormous size (hundreds of millions or billions of parameters) makes full fine-tuning incredibly expensive, requiring large amounts of vRAM and training time. Parameter-Efficient Fine-Tuning (PEFT) techniques allow us to adapt these massive models to new tasks by training […]

Ver mais

Like 0

Liked Liked

technocracy

Is Your “Human-in-the-Loop” Actually Slowing You Down? Here’s What We Learned

digitado ⋅ 30 de January de 2026

In the rush to adopt AI and automation, many teams implement human-in-the-loop (HITL) frameworks. They believe that involving a person in the process solves the problems with reliability, quality, and trust. But as we’ve learned from real engineering workflows and integrations, the story isn’t that easy. In some contexts, humans-in-the-loop do improve outcomes, but in others, they can unintentionally become bottlenecks that limit speed, scalability, and innovation. In this post, we’ll analyze when human-in-the-loop is truly valuable, when […]

Ver mais

Like 0

Liked Liked