digitado

Cross-Dataset Bloom Question Classification: Supervised Models and Prompted LLMs

digitado ⋅ 15 de June de 2026

arXiv:2606.13684v1 Announce Type: new Abstract: Automatic Bloom’s taxonomy classification of assessment questions can substantially reduce instructor workload, but labeling is subjective and teacher-dependent. Prior machine learning (ML) and deep learning (DL) approaches reported strong within-dataset results, yet were rarely evaluated in cross-dataset settings, leaving real-world generalizability unclear; meanwhile, LLM effectiveness for Bloom question classification has not been systematically studied. We evaluated the cross-dataset generalization of existing ML/DL methods and assessed LLMs with multiple prompting strategies on five datasets; […]

Ver mais

Like 0

Liked Liked

technocracy

Building Production-Ready RAG Systems with Free LLMs: From Zero to Analysis-Ready in 6 Steps

digitado ⋅ 17 de February de 2026

Introduction When I started exploring Retrieval-Augmented Generation (RAG) systems for incident analysis, I realized that jumping straight into paid APIs like Claude or OpenAI wasn’t practical for learning and experimentation. Instead, I wanted to build something completely local, free to run, and powerful enough to handle real production scenarios. This article documents my journey building a fully functional RAG system that analyzes production incidents by learning from past issues — without spending a dime on API calls. Everything runs on […]

Ver mais

Like 0

Liked Liked

technocracy

SaaSpocalypse Has Already Begun

digitado ⋅ 10 de February de 2026

Anthropic published a lawyer-related plugin on 30 January, and within four to five days, software stocks had lost billions of dollars. The pace of the collapse left everyone shocked, as the whole industry had been operating on a fundamental misperception of how the AI economy works. Why Everyone Thought They Were Safe The narrative that dominated the technological field held that AI companies would build baseline models, which would then serve as a foundation for software companies to […]

Ver mais

Like 0

Liked Liked

technocracy

La estafa perfecta: cuando tu móvil es la puerta y la IA es la llave maestra

digitado ⋅ 11 de March de 2026

Mi columna de esta semana en Invertia se titula «Estamos en 2026: no te van a ‘hackear’, te van a ‘convencer’» (pdf), y trata sobre algo que seguimos resistiéndonos a aceptar porque nos obliga a reconocer una verdad incómoda: la mayoría de los fraudes financieros no son un problema de tecnología, sino de psicología. No «entran» en tu banco rompiendo sistemas, entran en tu cabeza fabricando urgencias, miedos y atajos. Y lo hacen cada vez mejor porque el […]

Ver mais

Like 0

Liked Liked

technocracy

The $12.6 Million “Patient Zero”: Healthcare’s Identity Crisis

digitado ⋅ 15 de April de 2026

In the high-stakes bazaar of the dark web, there is a clear hierarchy of value. A stolen credit card is a mere commodity, often trading for the price of a cup of coffee. A Social Security number is a versatile tool. But a comprehensive medical record? That is a masterpiece. It contains a permanent, unchangeable blueprint of a human life — history, genetic markers, and biometric vulnerabilities. Unlike a compromised bank account, which can be frozen and reset, […]

Ver mais

Like 0

Liked Liked

technocracy

EVNextTrade: Learning-to-Rank-Based Recommendation of Next Charging Nodes for EV-EV Energy Trading

digitado ⋅ 31 de March de 2026

arXiv:2603.26688v1 Announce Type: new Abstract: Peer-to-peer energy trading among electric vehicles (EVs) has been increasingly studied as a promising solution for improving supply-side resilience under growing charging demand and constrained charging infrastructure. While prior studies on EV-EV energy trading and related EV research have largely focused on transaction management or isolated mobility prediction tasks, the problem of identifying which charging nodes are more suitable for EV-EV trading in journey contexts remains open. We address this gap by formulating […]

Ver mais

Like 0

Liked Liked

technocracy

Conformal Blindness: A Note on $A$-Cryptic change-points

digitado ⋅ 23 de January de 2026

arXiv:2601.01147v2 Announce Type: replace Abstract: Conformal Test Martingales (CTMs) are a standard method within the Conformal Prediction framework for testing the crucial assumption of data exchangeability by monitoring deviations from uniformity in the p-value sequence. Although exchangeability implies uniform p-values, the converse does not hold. This raises the question of whether a significant break in exchangeability can occur, such that the p-values remain uniform, rendering CTMs blind. We answer this affirmatively, demonstrating the phenomenon of emph{conformal blindness}. Through […]

Ver mais

Like 0

Liked Liked

technocracy

Rethinking Reinforcement fine-tuning of LLMs: A Multi-armed Bandit Learning Perspective

digitado ⋅ 21 de January de 2026

A large number of heuristics have been proposed to optimize the reinforcement fine-tuning of LLMs. However, inconsistent claims are made from time to time, making this area elusive. Reflecting on this situation, two fundamental questions still lack a clear understanding: 1) what is the role of each optimizing choice? 2) which ones are the bottlenecks? This paper aims to shed light on them, and it faces the challenge of several entangled confounding factors in the fine-tuning process. To […]

Ver mais

Like 0

Liked Liked

technocracy

Instruction-Tuned, but Not More Verifiable Instruction-Following: A Cross-Task Diagnosis for LoRA Adapters

digitado ⋅ 25 de March de 2026

arXiv:2603.22379v1 Announce Type: new Abstract: Adapters are often selected and deployed based on nominal labels (e.g., instruction-tuned), which implicitly suggest what capability improves after adaptation. We test whether nominal training objectives reliably align with realized cross-task capability gains by evaluating the same LoRA adapter across tasks. Our strongest evidence is tied to strict, automatically verifiable instruction following as measured by IFEval: across multiple seeds, base models, and LoRA settings, nominal labels recurrently but not universally fail to predict […]

Ver mais

Like 0

Liked Liked

technocracy

Early land animals skipped the tadpole phase

digitado ⋅ 23 de June de 2026

For decades, biologists thought that early tetrapods, ancient vertebrates that started conquering the land over 300 million years ago, developed like modern amphibians—beginning their lives as purely aquatic tadpoles and then metamorphosing into terrestrial adults. “A lot of that comes from this old ‘scala naturae’ idea that you had fish that evolved into the next stage up, which were amphibians, and then amphibians evolved into the next stage up, which were reptiles that evolved into birds and mammals,” […]

Ver mais

Like 0

Liked Liked