Ветров Дмитрий Петрович

Факультет компьютерных наук

Профиль на hse.ru ↗ тел.: +7 (495) 772-95-90 | 27252

Публикаций

Языков

Наград

Конференций

Профиль Публикации (86) Курсы (2)

Должности

Научный руководитель — Факультет компьютерных наук, Институт искусственного интеллекта и цифровых наук
Профессор-исследователь — Факультет компьютерных наук, Департамент больших данных и информационного поиска

Био

· Начал работать в НИУ ВШЭ в 2014 году.
· Научно-педагогический стаж: 15 лет.

Образование

2007 · Кандидат физико-математических наук: Московский государственный университет им. М.В. Ломоносова, специальность 01.01.09 «Дискретная математика и математическая кибернетика», тема диссертации: Влияние устойчивости алгоритмов классификации на точность их работы
2003 · Специалитет: Московский государственный университет им. М.В. Ломоносова, специальность «Прикладная математика и информатика», квалификация «Математик. Системный программист»

Опыт работы

· 2017-н.в.: : руководитель центра глубинного обучения и байесовских методов (НИУ ВШЭ, факультет компьютерных наук)
· 2018-2020: : руководитель лаборатории компании Самсунг (НИУ ВШЭ, факультет компьютерных наук)
· 2016-н.в.: : профессор-исследователь (НИУ ВШЭ, факультет компьютерных наук)
· 2016-н.в.: : профессор-ислледователь (НИУ ВШЭ, факультет компьютерных наук)
· 2016-2018: : Яндекс, ведущий исследователь (полставки)
· 2015-2016: : Сколтех, доцент
· 2014-2016: : НИУ ВШЭ, факультет компьютерных наук, доцент (неполная ставка)
· 2014-2015: : МГУ, факультет вычислительной математики и кибернетики, доцент
· 2011-2014: : МГУ, факультет вычислительной математики и кибернетики, ассистент
· 2010-2012: : Курчатовский институт, НБИК-центр, зав. лабораторией (полставки)
· 2007-2011: : МГУ, факультет вычислительной математики и кибернетики, научный сотрудник
· 2005: Лето
· 2006: : Валлийский университет, Бангор, стажер
· 2000-2007: : Вычислительный центр им. А.А. Дородницына РАН, математик (полставки)

Награды и поощрения

· Благодарность НИУ ВШЭ (март 2024)
· Благодарственное письмо первого проректора НИУ ВШЭ (февраль 2023)
· Почетное звание "Почетный работник сферы образования Российской Федерации" (ноябрь 2022)
· Почетная грамота НИУ ВШЭ (февраль 2022)
· Почетная грамота НИУ ВШЭ (декабрь 2015)
· Золотая медаль Российского отделения Европейской академии за цикл научных работ по байесовской регуляризации и выводу в графических моделях (декабрь 2012)
· Стипендия Президента РФ для ведущих молодых ученых (июнь 2012)
· Надбавка за публикации, вносящие особый вклад в международную научную репутацию НИУ ВШЭ (2022–2025, 2021–2024)
· Надбавка за публикацию в международном рецензируемом научном издании (2019–2021, 2017–2019)
· Надбавка за статью в зарубежном рецензируемом журнале (2015–2017)
· Лучший преподаватель — 2019–2020, 2019

Гранты и проекты

— · на соискание учёной степени кандидата наук

Конференции (1)

Показать все

· 2016: Advances in Neural Information Processing Systems 2016 (Барселона). Доклад: PerforatedCNNs: Acceleration through Elimination of Redundant Convolutions

Идентификаторы исследователя

ORCID: 0000-0001-6863-9028
ResearcherID: H-4870-2015
SPIN РИНЦ: 4339-7570
Google Scholar: https://scholar.google.ru/citations?user=7HU0UoUAAAAJ&hl=ru
Scopus AuthorID: 8382687000

Публикации (86)

TEncDM: Understanding the Properties of the Diffusion Model in the Space of Language Model Encodings

2025 · CHAPTER · en

This paper presents the Text Encoding Diffusion Model (TEncDM), a novel approach to diffusion modeling that operates in the space of pre-trained language model encodings. In contrast to traditionally used embeddings, encodings integrate contextual information. In our approach, we also employ a transformer-based decoder, specifically designed to incorporate context in the token prediction process. We conduct a comprehensive examination of the influence of the encoder, decoder, noise scheduler, and self-conditioning on zero-shot generation. Furthermore, we compare TEncDM with previous approaches on three conditional text generation tasks: QQP, XSum, and Wiki-Auto. The results show that TEncDM exhibits superior performance compared to existing non-autoregressive diffusion models.

DOI ↗ PDF ↗

HairFastGAN: Realistic and Robust Hair Transfer with a Fast Encoder-Based Approach

2024 · CHAPTER · en

PDF ↗

Regularized Distribution Matching Distillation for One-step Unpaired Image-to-Image Translation

2024 · CHAPTER · en

Diffusion distillation methods aim to compress the diffusion models into efficient one-step generators while trying to preserve quality. Among them, Distribution Matching Distillation (DMD) offers a suitable framework for training general-form onestep generators, applicable beyond unconditional generation. In this work, we introduce its modif ication, called Regularized Distribution Matching Distillation, applicable to unpaired image-toimage problems. We demonstrate its empirical performance in application to several translation tasks, including 2D examples and I2I between different image datasets, where it performs on par or better than multi-step diffusion baselines.

DOI ↗ PDF ↗

Where Do Large Learning Rates Lead Us?

2024 · CHAPTER · en

DOI ↗

HairFastGAN: Realistic and Robust Hair Transfer with a Fast Encoder-Based Approach

2024 · CHAPTER · en

Our paper addresses the complex task of transferring a hairstyle from a reference image to an input photo for virtual hair try-on. This task is challenging due to the need to adapt to various photo poses, the sensitivity of hairstyles, and the lack of objective metrics. The current state of the art hairstyle transfer methods use an optimization process for different parts of the approach, making them inexcusably slow. At the same time, faster encoder-based models are of very low quality because they either operate in StyleGAN’s W+ space or use other low-dimensional image generators. Additionally, both approaches have a problem with hairstyle transfer when the source pose is very different from the target pose, because they either don’t consider the pose at all or deal with it inefficiently. In our paper, we present the HairFast model, which uniquely solves these problems and achieves high resolution, near real-time performance, and superior reconstruction compared to optimization problem-based methods. Our solution includes a new architecture operating in the FS latent space of StyleGAN, an enhanced inpainting approach, and improved encoders for better alignment, color transfer, and a new encoder for post-processing. The effectiveness of our approach is demonstrated on realism metrics after random hairstyle transfer and reconstruction when the original hairstyle is transferred. In the most difficult scenario of transferring both shape and color of a hairstyle from different images, our method performs in less than a second on the Nvidia V100.

DOI ↗ PDF ↗

Differentiable Rendering with Reparameterized Volume Sampling

2024 · CHAPTER · en

PDF ↗

Generative Flow Networks as Entropy-Regularized RL

2024 · CHAPTER · en

The recently proposed generative flow networks (GFlowNets) are a method of training a policy to sample compositional discrete objects with probabilities proportional to a given reward via a sequence of actions. GFlowNets exploit the sequential nature of the problem, drawing parallels with reinforcement learning (RL). Our work extends the connection between RL and GFlowNets to a general case. We demonstrate how the task of learning a generative flow network can be efficiently redefined as an entropy-regularized RL problem with a specific reward and regularizer structure. Furthermore, we illustrate the practical efficiency of this reformulation by applying standard soft RL algorithms to GFlowNet training across several probabilistic modeling tasks. Contrary to previously reported results, we show that entropic RL approaches can be competitive against established GFlowNet training methods. This perspective opens a direct path for integrating RL principles into the realm of generative flow networks.

PDF ↗

The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing

2024 · CHAPTER · en

The task of manipulating real image attributes through StyleGAN inversion has been extensively researched. This process involves searching latent variables from a well-trained StyleGAN generator that can synthesize a real image modifying these latent variables and then synthesizing an image with the desired edits. A balance must be struck between the quality of the reconstruction and the ability to edit. Earlier studies utilized the low-dimensional W-space for latent search which facilitated effective editing but struggled with reconstructing intricate details. More recent research has turned to the high-dimensional feature space F which successfully inverses the input image but loses much of the detail during editing. In this paper we introduce StyleFeatureEditor -- a novel method that enables editing in both w-latents and F-latents. This technique not only allows for the reconstruction of finer image details but also ensures their preservation during editing. We also present a new training pipeline specifically designed to train our model to accurately edit F-latents. Our method is compared with state-of-the-art encoding approaches demonstrating that our model excels in terms of reconstruction quality and is capable of editing even challenging out-of-domain examples.

Improving GFlowNets with Monte Carlo Tree Search

2024 · CHAPTER · en

MARS: Masked Automatic Ranks Selection in Tensor Decompositions

2023 · CHAPTER · en

Tensor decomposition methods have proven effective in various applications, including compression and acceleration of neural networks. At the same time, the problem of determining optimal decomposition ranks, which present the crucial parameter controlling the compressionaccuracy trade-off, is still acute. In this paper, we introduce MARS - a new efficient method for the automatic selection of ranks in general tensor decompositions. During training, the procedure learns binary masks over decomposition cores that “select” the optimal tensor structure. The learning is performed via relaxed maximum a posteriori (MAP) estimation in a specific Bayesian model and can be naturally embedded into the standard neural network training routine. Diverse experiments demonstrate that MARS achieves better results compared to previous works in various tasks.

Курсы (2)

Байесовские методы в машинном обучении · 5 раза

2025/2026, 2024/2025, 2023/2024, 2022/2023, 2021/2022 · Бакалавриат / Бакалавриат направление: 38.03.01 Экономика / Дисциплина общефакультетского пула / Магистратура / Маго-лего · рус
Нейробайесовские методы в машинном обучении

2021/2022 · Магистратура · рус