Enable javascript in your browser for better experience. Need to know to enable it?

黑料门

Small language models

Published : Oct 23, 2024
Oct 2024
Trial ?

Large language models (LLMs) have proven useful in many areas of applications, but the fact that they are large can be a source of problems: responding to a prompt requires a lot of compute resources, making queries slow and expensive; the models are proprietary and so large that they must be hosted in a cloud by a third party, which can be problematic for sensitive data; and training a model is prohibitively expensive in most cases. The last issue can be addressed with the RAG pattern, which side-steps the need to train and fine-tune foundational models, but cost and privacy concerns often remain. In response, we’re now seeing growing interest in small language models (SLMs). In comparison to their more popular siblings, they have fewer weights and less precision, usually between 3.5 billion and 10 billion parameters. suggests that, in the right context, when set up correctly, SLMs can perform as well as or even outperform LLMs. And their size makes it possible to run them on edge devices. We've previously mentioned Google's Gemini Nano, but the landscape is evolving quickly, with Microsoft introducing its series, for example.

Download the PDF

?

?

?

English?|?Espa?ol?|?笔辞谤迟耻驳耻ê蝉?|?中文

Sign up for the Technology Radar newsletter

?

Subscribe now

Visit our archive to read previous volumes