zomia.news

15dec2023

Microsolf Phi-2 Language Model - it's an old and new way of doing language models. Instead of using a lot of random data, we can use a small subset of high quality data and use that to train the model. Just like in the "good old days" of classical machine learning.

As a result, we have a small and effective model that can be used on mobile devices.

The idea is that in the near future, this will be the main method for the creation of LMs:

It's easy to control the results
The result much more effective
Much more robust