digitado

Expected information gain estimation via density approximations: Sample allocation and dimension reduction

digitado ⋅ 30 de January de 2026

arXiv:2411.08390v3 Announce Type: replace-cross Abstract: Computing expected information gain (EIG) from prior to posterior (equivalently, mutual information between candidate observations and model parameters or other quantities of interest) is a fundamental challenge in Bayesian optimal experimental design. We formulate flexible transport-based schemes for EIG estimation in general nonlinear/non-Gaussian settings, compatible with both standard and implicit Bayesian models. These schemes are representative of two-stage methods for estimating or bounding EIG using marginal and conditional density estimates. In this setting, […]

Ver mais

Like 0

Liked Liked

technocracy

Implementing AI algorithms in GPUs and CPUs // Intro to AI Accelerators! :)

digitado ⋅ 31 de October de 2019

Ok so today my professor decided to tell us a Halloween joke, and I thought it was the funniest thing ever: “What is the circumference of a jack-o-lantern divided by its diameter??” Pumpkin pi !!!!!!!!!!!! lol, i thought it was funny!!!!!!!!!!!! (its ok if you didn’t) 🙂 happy halloween lol i didn’t really dress up like anything because I barely even have time to fully get ready in the morning, but if “a student who is heading over […]

Ver mais

Like 0

Liked Liked

technocracy

Extreme Value Policy Optimization for Safe Reinforcement Learning

digitado ⋅ 17 de January de 2026

Ensuring safety is a critical challenge in applying Reinforcement Learning (RL) to real-world scenarios. Constrained Reinforcement Learning (CRL) addresses this by maximizing returns under predefined constraints, typically formulated as the expected cumulative cost. However, expectation-based constraints overlook rare but high-impact extreme value events in the tail distribution, such as black swan incidents, which can lead to severe constraint violations. To address this issue, we propose the Extreme Value policy Optimization (EVO) algorithm, leveraging Extreme Value Theory (EVT) to […]

Ver mais

Like 0

Liked Liked

technocracy

Accelerate custom LLM deployment: Fine-tune with Oumi and deploy to Amazon Bedrock

digitado ⋅ 10 de March de 2026

This post is cowritten by David Stewart and Matthew Persons from Oumi. Fine-tuning open source large language models (LLMs) often stalls between experimentation and production. Training configurations, artifact management, and scalable deployment each require different tools, creating friction when moving from rapid experimentation to secure, enterprise-grade environments. In this post, we show how to fine-tune a Llama model using Oumi on Amazon EC2 (with the option to create synthetic data using Oumi), store artifacts in Amazon S3, and […]

Ver mais

Like 0

Liked Liked

technocracy

When Backdoors Go Beyond Triggers: Semantic Drift in Diffusion Models Under Encoder Attacks

digitado ⋅ 25 de February de 2026

arXiv:2602.20193v1 Announce Type: new Abstract: Standard evaluations of backdoor attacks on text-to-image (T2I) models primarily measure trigger activation and visual fidelity. We challenge this paradigm, demonstrating that encoder-side poisoning induces persistent, trigger-free semantic corruption that fundamentally reshapes the representation manifold. We trace this vulnerability to a geometric mechanism: a Jacobian-based analysis reveals that backdoors act as low-rank, target-centered deformations that amplify local sensitivity, causing distortion to propagate coherently across semantic neighborhoods. To rigorously quantify this structural degradation, we […]

Ver mais

Like 0

Liked Liked

technocracy

Hybrid by Design: Inside the Mamba-MoE Engine of Nemotron 3

digitado ⋅ 1 de February de 2026

Inside the Mamba-MoE Engine of Nemotron 3 TL;DR The Models: The family includes Nano, Super, and Ultra.The Architecture: A Hybrid Mamba-Transformer Mixture-of-Experts (MoE) design that replaces most attention layers with Mamba-2 layers for high throughput. Key Innovations: LatentMoE: A new expert routing mechanism in Super/Ultra that projects tokens into a smaller latent space to improve accuracy-per-byte. MTP (Multi-Token Prediction): Enables faster generation via native speculative decoding. NVFP4: Native 4-bit floating-point training for the larger models. Capabilities: Supports 1M token context […]

Ver mais

Like 0

Liked Liked

technocracy

Perspectives and Issues in Machine Learning: 5 Issues You Must Know!

digitado ⋅ 7 de January de 2026

“Machine learning is not just about algorithms; it’s about learning from the data of life itself.” This powerful thought by Andrew Ng perfectly captures the essence of today’s machine learning revolution. The steadily growing influence of machine learning (ML) in our daily lives, from online shopping recommendations to medical diagnoses, brings along both immense opportunities and serious questions. So as to understand that better, in this blog, we will look at perspectives and issues in machine learning, explore […]

Ver mais

Like 0

Liked Liked

technocracy

Robotic Assembly Using Deep Reinforcement Learning

digitado ⋅ 21 de October de 2020

Introduction Disclaimer: This article is a cross post from Pytorch Medium Blog Post. One of the most exciting advancements, that has pushed the frontier of the Artificial Intelligence (AI) in recent years, is Deep Reinforcement Learning (DRL). DRL belongs to the family of machine learning algorithms. It assumes that intelligent machines can learn from their actions similar to the way humans learn from experience. Over the recent years we could witness some impressive real-world applications of DRL. The […]

Ver mais

Like 0

Liked Liked

technocracy

A new paradigm for global sensitivity analysis

digitado ⋅ 23 de March de 2026

arXiv:2409.06271v2 Announce Type: replace Abstract: It is well-known that Sobol indices, which count among the most popular sensitivity indices, are based on the Sobol decomposition. Here we challenge this construction by redefining Sobol indices without the Sobol decomposition. In fact, we show that Sobol indices are a particular instance of a more general concept which we call sensitivity measures. A sensitivity measure of a system taking inputs and returning outputs is a set function that is null at […]

Ver mais

Like 0

Liked Liked

technocracy

DUGC-VRNet: Joint VR Recognition and Channel Estimation for Spatially Non-Stationary XL-MIMO

digitado ⋅ 30 de March de 2026

arXiv:2603.25754v1 Announce Type: new Abstract: In this letter, we address spatially non-stationary near-field channel estimation for extremely large-scale multiple-input multiple-output (XL-MIMO) systems with a hybrid combining architecture. One key challenge in the considered problem lies in that conventional channel estimation algorithms typically struggle to effectively identify and adapt to the partial antenna visibility caused by varying visibility regions (VRs), thereby compromising estimation accuracy. To perform joint VR recognition and channel estimation, we integrate a deep unfolding network (DUN) […]

Ver mais

Like 0

Liked Liked