I'm Thomas van Dongen. I am currently working as a lead machine learning engineer at Springer Nature. I am one of the founding members of The Minish Lab where we develop open-source machine learning packages.
My research interests include:
- 🚤 Small, fast models: Making CPU-friendly models.
- 🔍 Recommenders: Developing smarter systems to improve recommendations and information retrieval, focussed on the scientific publishing space.
- 🧩 Embeddings: Focusing on static embeddings to balance performance and resource usage.
I'm currently working on:
- model2vec: a library for creating state-of-the-art static embeddings by distilling sentence transformers.
- tokenlearn: a library for pre-training static embeddings.
- vicinity: a library for fast and lightweight nearest neighbors, with flexible indexing backends.
Info: