digitado

Formally proving a calculation with Claude and Lean

digitado ⋅ 11 de June de 2026

I ran an experiment today to see whether Claude [1] could generate Lean code to prove a calculation at the bottom of this post, six lines of calculus. I started with this prompt This page contains a mathematical proof that a Fourier coefficient, a_n, is given in terms of a Bessel function. The LaTeX source for the SVG image is contained in the alt tag of the image. Generate a formal proof of the result using Lean. and […]

Ver mais

Like 0

Liked Liked

technocracy

Huntington Bank: Redacting sensitive data from 400M+ documents with AWS

digitado ⋅ 24 de June de 2026

When your document repository contains hundreds of millions of files accumulated over nearly a decade, how do you systematically find and redact sensitive customer data without taking years to complete? This was the challenge facing The Huntington National Bank (Huntington), a top 10 bank in the United States. Redacting sensitive information at scale Since 2015, Huntington’s document management system has securely stored hundreds of millions of documents on-premises. In 2025, as part of a proactive compliance initiative, Huntington […]

Ver mais

Like 0

Liked Liked

technocracy

A graph neural network based chemical mechanism reduction method for combustion applications

digitado ⋅ 25 de March de 2026

arXiv:2603.22318v1 Announce Type: new Abstract: Direct numerical simulations of turbulent reacting flows involving millions of grid points and detailed chemical mechanisms with hundreds of species and thousands of reactions are computationally prohibitive. To address this challenge, we present two data-driven chemical mechanism reduction formulations based on graph neural networks (GNNs) with message-passing transformer layers that learn nonlinear dependencies among species and reactions. The first formulation, GNN-SM, employs a pre-trained surrogate model to guide reduction across a broad range […]

Ver mais

Like 0

Liked Liked

technocracy

CDP vs MDM: Similar Goals, Different Jobs

digitado ⋅ 3 de April de 2026

In conversations about customer data, one question comes up again and again: if both CDPs and MDMs help create a more complete view of the customer, are they basically doing the same thing? It is an understandable question. After all, both technologies are often positioned around customer unification, identity resolution, and creating better visibility across systems. On the surface, they can sound very similar. But while CDPs and MDMs do overlap in some areas, they are not the […]

Ver mais

Like 0

Liked Liked

technocracy

VJEPA: Variational Joint Embedding Predictive Architectures as Probabilistic World Models

digitado ⋅ 22 de January de 2026

arXiv:2601.14354v1 Announce Type: new Abstract: Joint Embedding Predictive Architectures (JEPA) offer a scalable paradigm for self-supervised learning by predicting latent representations rather than reconstructing high-entropy observations. However, existing formulations rely on textit{deterministic} regression objectives, which mask probabilistic semantics and limit its applicability in stochastic control. In this work, we introduce emph{Variational JEPA (VJEPA)}, a textit{probabilistic} generalization that learns a predictive distribution over future latent states via a variational objective. We show that VJEPA unifies representation learning with Predictive […]

Ver mais

Like 0

Liked Liked

technocracy

AI, Metacognition, and the Verification Bottleneck: A Three-Wave Longitudinal Study of Human Problem-Solving

digitado ⋅ 27 de January de 2026

arXiv:2601.17055v1 Announce Type: new Abstract: This longitudinal pilot study tracked how generative AI reshapes problem-solving over six months across three waves in an academic setting. AI integration reached saturation by Wave 3, with daily use rising from 52.4% to 95.7% and ChatGPT adoption from 85.7% to 100%. A dominant hybrid workflow increased 2.7-fold, adopted by 39.1% of participants. The verification paradox emerged: participants relied most heavily on AI for difficult tasks (73.9%) yet showed declining verification confidence (68.1%) […]

Ver mais

Like 0

Liked Liked

technocracy

Full-Stack Data Scientists for the Agentic Coding World

digitado ⋅ 29 de May de 2026

Author(s): Michael Shapiro MD MSc Originally published on Towards AI. The Next Evolution of Data Teams For years, building data products required a chain of specialists: data engineers, data scientists, software engineers, ML engineers, MLOps teams, and product managers. This specialization enabled organizations to tackle increasingly complex problems, but it also introduced handoffs, dependencies, and slower feedback cycles. (If you’re not a Medium Member, read it for free here). After the introduction, the article explains why agentic coding […]

Ver mais

Like 0

Liked Liked

technocracy

OpenClaw security fears lead Meta, other AI firms to restrict its use

digitado ⋅ 19 de February de 2026

Last month, Jason Grad issued a late-night warning to the 20 employees at his tech startup. “You’ve likely seen Clawdbot trending on X/LinkedIn. While cool, it is currently unvetted and high-risk for our environment,” he wrote in a Slack message with a red siren emoji. “Please keep Clawdbot off all company hardware and away from work-linked accounts.” Grad isn’t the only tech executive who has raised concerns to staff about the experimental agentic AI tool, which was briefly […]

Ver mais

Like 0

Liked Liked

technocracy

Gaussian Process Bandit Optimization with Machine Learning Predictions and Application to Hypothesis Generation

digitado ⋅ 2 de February de 2026

arXiv:2601.22315v1 Announce Type: new Abstract: Many real-world optimization problems involve an expensive ground-truth oracle (e.g., human evaluation, physical experiments) and a cheap, low-fidelity prediction oracle (e.g., machine learning models, simulations). Meanwhile, abundant offline data (e.g., past experiments and predictions) are often available and can be used to pretrain powerful predictive models, as well as to provide an informative prior. We propose Prediction-Augmented Gaussian Process Upper Confidence Bound (PA-GP-UCB), a novel Bayesian optimization algorithm that leverages both oracles and […]

Ver mais

Like 0

Liked Liked

technocracy

The End of Token Inflation with DeepSeek OCR-2

digitado ⋅ 2 de February de 2026

Author(s): Mandar Karhade, MD. PhD. Originally published on Towards AI. How “Context Optical Compression” Re-Engineers Document Processing from First Principles The tech world buzzes with excitement every time a leaderboard changes hands, usually celebrating a massive model with a parameter count that rivals the number of stars in the galaxy. But sometimes, the most disruptive shifts aren’t about getting bigger — they’re about getting smarter with the resources we already have. DeepSeek OCR 2 BenchmarksDeepSeek OCR 2 introduces […]

Ver mais

Like 0

Liked Liked