Small Language Models: Architecture, Evolution, and the Future of Artificial Intelligence
Large language models (LLMs) have significantly advanced artificial intelligence, yet their high com-putational, energy, and privacy costs pose substantial challenges. In contrast, Small Language Models(SLMs), typically with fewer than 15 billion parameters, have emerged as efficient alternatives. Thissurvey provides a comprehensive analysis of the SLM landscape, tracing their evolution and examiningarchitectural innovations that enhance efficiency. A novel multi-axis taxonomy is introduced to classifySLMs by genesis, architecture, and optimization goals, offering a structured framework for this field.Performance benchmarks are […]