PPO from Scratch — A Self-Contained PyTorch Implementation Tested on Atari
submitted by /u/papers-100-lines [link] [comments]
submitted by /u/papers-100-lines [link] [comments]
Traditional mean–variance portfolio optimization proves inadequate for cryptocurrency markets, where extreme volatility, fat-tailed return distributions, and unstable correlation structures undermine the validity of variance as a comprehensive risk measure. To address these limitations, this paper proposes a unified entropy-based portfolio optimization framework grounded in the Maximum Entropy Principle (MaxEnt). Within this setting, Shannon entropy, Tsallis entropy, and Weighted Shannon Entropy (WSE) are formally derived as particular specifications of a common constrained optimization problem solved via the method of […]
arXiv:2601.03290v1 Announce Type: new Abstract: The deployment of transformer-based models on resource-constrained edge devices represents a critical challenge in enabling real-time artificial intelligence applications. This comprehensive survey examines lightweight transformer architectures specifically designed for edge deployment, analyzing recent advances in model compression, quantization, pruning, and knowledge distillation techniques. We systematically review prominent lightweight variants including MobileBERT, TinyBERT, DistilBERT, EfficientFormer, EdgeFormer, and MobileViT, providing detailed performance benchmarks on standard datasets such as GLUE, SQuAD, ImageNet-1K, and COCO. Our analysis […]
arXiv:2310.11143v5 Announce Type: replace Abstract: Accurate knowledge of indoor radon concentration is crucial for assessing radon-related health effects or identifying radon-prone areas. Indoor radon concentration at the national scale is usually estimated on the basis of extensive measurement campaigns. However, characteristics of the sampled households often differ from the characteristics of the target population owing to the large number of relevant factors that control the indoor radon concentration, such as the availability of geogenic radon or floor level. […]
From a weekend chore to a fun application of valuable operations research principles The post How I Optimized My Leaf Raking Strategy Using Linear Programming appeared first on Towards Data Science.
Preliminaries Gaussian distribution log-likelihood Calculus partial derivative Lagrange multiplier EM Algorithm for Gaussian Mixture1 Analysis Maximizing likelihood could not be used in the Gaussian mixture model directly, because of its severe defects which we have come across at ‘Maximum Likelihood of Gaussian Mixtures’. With the inspiration of K-means, a two-step algorithm was developed. The objective function is the log-likelihood function: [ begin{aligned} ln Pr(mathbf{x}|mathbf{pi},mathbf{mu},Sigma)&=ln (Pi_{n=1}^Nsum_{j=1}^{K}pi_kmathcal{N}(mathbf{x}|mathbf{mu}_k,Sigma_k))\ &=sum_{n=1}^{N}ln sum_{j=1}^{K}pi_jmathcal{N}(mathbf{x}_n|mathbf{mu}_j,Sigma_j)\ end{aligned}tag{1} ]
Nvidia unveiled Alpamayo at CES 2026, which includes a reasoning vision language action model that allows an autonomous vehicle to think more like a human and provide chain-of-thought reasoning.
The data intelligence company has just raised more than $4 billion in a Series L funding round at a $134 billion valuation — up 34% from the $100 billion valuation that it achieved just three months ago.
In my previous blog post, I explained in detail how large language models (LLMs) work, using analogies and maths. I received many requests for a simpler explanation of how a tokenizer is trained for such models, and how tokenization works. Hence, this blog post. How LLMs Work: A Beginner’s Guide to Decoder-Only Transformers A language model like GPT (which stands for Generative Pretrained Transformer) takes text, breaks it into tokens (words or subwords), converts those tokens into numbers, […]
Later this month, Volvo will unveil its new EX60 SUV. The Swedish automaker has adopted some of the latest trends in electric vehicle design for the EX60, like a structural battery pack and the use of very large castings. As always with automakers teasing a new car, concrete details are only emerging slowly ahead of the official reveal on January 21, but we can say that range and recharging speeds were a priority during the design process. “With […]