15dec2023
Microsolf Phi-2 Language Model - it's an old and new way of doing language models. Instead of using a lot of random data, we can use a small subset of high quality data and use that to train the model. Just like in the "good old days" of classical machine learning.
As a result, we have a small and effective model that can be used on mobile devices.
The idea is that in the near future, this will be the main method for the creation of LMs:
- It's easy to control the results
- The result much more effective
- Much more robust