Ilya Gusev

Machine Learning Engineer
Amsterdam, Netherlands E-mail GitHub LinkedIn Telegram channel Telegram DM Scholar

Demos

Telegram bot with access to different LLMs via OpenRouter and Runpod.
Amateur-level C++ chess engine with web GUI on top of lc0 board representation.
Meme generation on top of Nano Banana Pro.
Semantic search on top of Russian NLP Telegram chats and channels.

Interests

Natural Language Processing Text Summarization Language Models Multi-Agent Systems Chatbots

Experience

Senior Machine Learning Engineer Aug 2022 — Present
Booking.com, Amsterdam
Worked on machine translation, semantic search for hotels, and LLM-based chatbots. Mainly in Python and Java.
Senior Machine Learning Engineer Mar 2018 — Jul 2022
Yandex, Moscow
Worked on many components of Yandex News, including news clustering, summarization, ranking, and recommendations. Python/MapReduce/SQL for prototyping and analytics, C++ for production-ready solutions.
University Teacher Sep 2016 — Jun 2021
Moscow Institute of Physics and Technology
Taught Algorithms to 1st and 2nd-year undergraduates.
Software Engineer Mar 2016 — Mar 2018
ABBYY, Moscow
Worked in the LingvoLive backend team, improving the search of cards with translations of words. Then worked in the linguistics department on tools for machine learning.

Selected Projects

Multi-agent system for writing scientific papers with language models.
⭐ 46
CodeAct-based agentic framework for autonomous task solving.
⭐ 32
MCP server with tools for scientific research.
⭐ 27
Saiga 2024
Open datasets and language models for the Russian language. Complete training pipeline for instruction-tuned models.
❤️ 500+ · ⬇️ 2M
Benchmark for evaluating role-playing language models with user emulation.
⭐ 112
Models for abstractive and extractive summarization of Russian texts.
⭐ 174
RuPo 2017
Russian poetry analyzer and generator using neural networks.
⭐ 177
One of the first contextual morphological analyzers for the Russian language.
⭐ 156

Competitions

Chai Competition Sep 2023 — Jan 2024
Fine-tuned LLMs for role-play conversations.
💰 $5k
Telegram Channel Classification Contest Jul 2021
Built a system for identifying language and category of Telegram channels. 1st place in Round 1 and 2nd place in Round 2.
💰 $19k
Telegram News Aggregation Contest Feb 2020
Built a news aggregation system in C++. 3rd place in Round 1 and 1st place in Round 2.
💰 $31k

Selected Publications

HotelMatch-LLM: Joint Multi-Task Training of Small and Large Language Models for Efficient Multimodal Hotel Retrieval
Arian Askari, Emmanouil Stergiadis, Ilya Gusev , Moran Beladev
ACL 2025 · arxiv:2506.07296
Speed Without Sacrifice: Fine-Tuning Language Models with Medusa and Knowledge Distillation in Travel Applications
Daniel Zagyva, Emmanouil Stergiadis, Laurens Van Der Maas, Aleksandra Dokic, Eran Fainman, Ilya Gusev , Moran Beladev
PingPong: A Benchmark for Role-Playing Language Models with User Emulation and Multi-Model Evaluation
Ilya Gusev
Do not lose the message while paraphrasing: A study on content preserving style transfer
Nikolay Babakov, David Dale, Ilya Gusev , Irina Krotova, Alexander Panchenko
NLDB 2023 · arxiv:2308.09055
HeadlineCause: A dataset of news headlines for detecting causalities
Ilya Gusev , Alexey Tikhonov
LREC 2021 · arxiv:2108.12626
Dataset for Automatic Summarization of Russian News
Ilya Gusev
AINL 2020 · arxiv:2006.11063
Improving part-of-speech tagging via multi-task learning and character-level word representations
Daniil Anastasyev, Ilya Gusev , Evgenii Indenbom
Dialogue 2018 · arxiv:1807.00818

Education

Master in Computer Science 2018
Moscow Institute of Physics and Technology
Bachelor in Math and Physics 2016
Moscow Institute of Physics and Technology

Website QR Code

QR Code for ilyagusev.dev
Scan to visit ilyagusev.dev