zomia.news

15dec2023

Microsolf Phi-2 Language Model - it's an old and new way of doing language models. Instead of using a lot of random data, we can use a small subset of high quality data and use that to train the model. Just like in the "good old days" of classical machine learning.

As a result, we have a small and effective model that can be used on mobile devices.

The idea is that in the near future, this will be the main method for the creation of LMs:

  • It's easy to control the results
  • The result much more effective
  • Much more robust