Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

A newer but much better system actually reduces the model size while reducing the functionality of the system - similar to training a NN for a very specific task (as was typical several years ago), but now it can happen with far less data. https://arxiv.org/pdf/2305.02301.pdf This paper is quite fantastic, and will likely shape up to be a quite important glue task for LLM models to generate.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: