Small language models

You’ve experienced the power of large language models (LLMs) if you’ve used Copilot to answer complex questions. The models are so large that they can require significant computing resources to run, making the rise of small language models (SLMs) a big deal.

SLMs are still quite large with several billion parameters — in contrast to hundreds of billions of parameters in LLMs — but they’re small enough to run on a phone offline. Parameters are variables, or adjustable elements, that determine a model’s behavior.

“Small language models can make AI more accessible due to their size and affordability,” says Sebastien Bubeck, who leads the Machine Learning Foundations group at Microsoft Research. “At the same time, we’re discovering new ways to make them as powerful as large language models.”

Microsoft researchers have developed and released two SLMs — Phi and Orca— that perform as well as or better than large language models in certain areas, challenging the notion that scale is required for performance.

Unlike LLMs trained on vast amounts of internet data, the smaller models use curated, high-quality training data, with researchers finding new thresholds for size and performance. This year, you can expect to see improved models designed to foster more research and innovation.

What's Hot

Small language models

Llama 2: Open Foundation and Fine-Tuned Chat Models

The developer’s guide to open source LLMs and generative AI

Llama 2: Open Foundation and Fine-Tuned Chat Models

Introducing Gemini: Google’s AI Gets a Fresh Identity!

Google’s Bard chatbot gets the Gemini Pro update globally

Google’s Lumiere brings AI video closer to real than unreal.

Google Introduces Gemini, a Cutting-Edge Language Model Set

DJI Avata Review: Immersive FPV Flying For Drone Enthusiasts

Bose QuietComfort Earbuds II: Noise-Cancellation Kings Reviewed

Thousands Of PC Games Discounted In New Black Friday Sale

Take Your Photography to The Next Level with This Drone

Will Using a VPN on Phone Helps Protect You from Ransomware?

Popular New Xbox Game Pass Game Being Review Bombed With “0s”

Google Says Surveillance Vendor Targeted Samsung Phones

Why Are iPhones More Expensive Than Android Phones?

Llama 2: Open Foundation and Fine-Tuned Chat Models

The developer’s guide to open source LLMs and generative AI

Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs

Single-View 3D Human Digitalization with Large Reconstruction Models

Llama 2: Open Foundation and Fine-Tuned Chat Models

Pico 4 Review: Should You Actually Buy One Instead Of Quest 2?

A Review of the Venus Optics Argus 18mm f/0.95 MFT APO Lens

DJI Avata Review: Immersive FPV Flying For Drone Enthusiasts

Our Picks

Small language models

Llama 2: Open Foundation and Fine-Tuned Chat Models

The developer’s guide to open source LLMs and generative AI

Most Popular

Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs

Single-View 3D Human Digitalization with Large Reconstruction Models

Llama 2: Open Foundation and Fine-Tuned Chat Models

Latest Papers

Llama 2: Open Foundation and Fine-Tuned Chat Models

Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs

Single-View 3D Human Digitalization with Large Reconstruction Models

What's Hot

Small language models

Related Posts