Ветров Дмитрий Петрович

Факультет компьютерных наук

Профиль на hse.ru ↗ тел.: +7 (495) 772-95-90 | 27252

Публикаций

Языков

Наград

Конференций

Профиль Публикации (86) Курсы (2)

Должности

Научный руководитель — Факультет компьютерных наук, Институт искусственного интеллекта и цифровых наук
Профессор-исследователь — Факультет компьютерных наук, Департамент больших данных и информационного поиска

Био

· Начал работать в НИУ ВШЭ в 2014 году.
· Научно-педагогический стаж: 15 лет.

Образование

2007 · Кандидат физико-математических наук: Московский государственный университет им. М.В. Ломоносова, специальность 01.01.09 «Дискретная математика и математическая кибернетика», тема диссертации: Влияние устойчивости алгоритмов классификации на точность их работы
2003 · Специалитет: Московский государственный университет им. М.В. Ломоносова, специальность «Прикладная математика и информатика», квалификация «Математик. Системный программист»

Опыт работы

· 2017-н.в.: : руководитель центра глубинного обучения и байесовских методов (НИУ ВШЭ, факультет компьютерных наук)
· 2018-2020: : руководитель лаборатории компании Самсунг (НИУ ВШЭ, факультет компьютерных наук)
· 2016-н.в.: : профессор-исследователь (НИУ ВШЭ, факультет компьютерных наук)
· 2016-н.в.: : профессор-ислледователь (НИУ ВШЭ, факультет компьютерных наук)
· 2016-2018: : Яндекс, ведущий исследователь (полставки)
· 2015-2016: : Сколтех, доцент
· 2014-2016: : НИУ ВШЭ, факультет компьютерных наук, доцент (неполная ставка)
· 2014-2015: : МГУ, факультет вычислительной математики и кибернетики, доцент
· 2011-2014: : МГУ, факультет вычислительной математики и кибернетики, ассистент
· 2010-2012: : Курчатовский институт, НБИК-центр, зав. лабораторией (полставки)
· 2007-2011: : МГУ, факультет вычислительной математики и кибернетики, научный сотрудник
· 2005: Лето
· 2006: : Валлийский университет, Бангор, стажер
· 2000-2007: : Вычислительный центр им. А.А. Дородницына РАН, математик (полставки)

Награды и поощрения

· Благодарность НИУ ВШЭ (март 2024)
· Благодарственное письмо первого проректора НИУ ВШЭ (февраль 2023)
· Почетное звание "Почетный работник сферы образования Российской Федерации" (ноябрь 2022)
· Почетная грамота НИУ ВШЭ (февраль 2022)
· Почетная грамота НИУ ВШЭ (декабрь 2015)
· Золотая медаль Российского отделения Европейской академии за цикл научных работ по байесовской регуляризации и выводу в графических моделях (декабрь 2012)
· Стипендия Президента РФ для ведущих молодых ученых (июнь 2012)
· Надбавка за публикации, вносящие особый вклад в международную научную репутацию НИУ ВШЭ (2022–2025, 2021–2024)
· Надбавка за публикацию в международном рецензируемом научном издании (2019–2021, 2017–2019)
· Надбавка за статью в зарубежном рецензируемом журнале (2015–2017)
· Лучший преподаватель — 2019–2020, 2019

Гранты и проекты

— · на соискание учёной степени кандидата наук

Конференции (1)

Показать все

· 2016: Advances in Neural Information Processing Systems 2016 (Барселона). Доклад: PerforatedCNNs: Acceleration through Elimination of Redundant Convolutions

Идентификаторы исследователя

ORCID: 0000-0001-6863-9028
ResearcherID: H-4870-2015
SPIN РИНЦ: 4339-7570
Google Scholar: https://scholar.google.ru/citations?user=7HU0UoUAAAAJ&hl=ru
Scopus AuthorID: 8382687000

Публикации (86)

Loss function dynamics and landscape for deep neural networks trained with quadratic loss

2023 · CHAPTER · en

Knowledge of the loss landscape geometry makes it possible to successfully explain the behavior of neural networks, the dynamics of their training, and the relationship between resulting solutions and hyperparameters, such as the regularization method, neural network architecture, or learning rate schedule. In this paper, the dynamics of learning and the surface of the standard cross-entropy loss function and the currently popular mean squared error (MSE) loss function for scale-invariant networks with normalization are studied. Symmetries are eliminated via the transition to optimization on a sphere. As a result, three learning phases with fundamentally different properties are revealed depending on the learning step on the sphere, namely, convergence phase, phase of chaotic equilibrium, and phase of destabilized learning. These phases are observed for both loss functions, but larger networks and longer learning for the transition to the convergence phase are required in the case of MSE loss.

StyleDomain: Efficient and Lightweight Parameterizations of StyleGAN for One-shot and Few-shot Domain Adaptation

2023 · CHAPTER · en

DOI ↗

Differentiable Rendering with Reparameterized Volume Sampling

2023 · CHAPTER · en

In view synthesis, a neural radiance field approximates underlying density and radiance fields based on a sparse set of scene pictures. To generate a pixel of a novel view, it marches a ray through the pixel and computes a weighted sum of radiance emitted from a dense set of ray points. This rendering algorithm is fully differentiable and facilitates gradient-based optimization of the fields. However, in practice, only a tiny opaque portion of the ray contributes most of the radiance to the sum. We propose a simple end-to-end differentiable sampling algorithm based on inverse transform sampling. It generates samples according to the probability distribution induced by the density field and picks non-transparent points on the ray. We utilize the algorithm in two ways. First, we propose a novel rendering approach based on Monte Carlo estimates. This approach allows for evaluating and optimizing a neural radiance field with just a few radiance field calls per ray. Second, we use the sampling algorithm to modify the hierarchical scheme proposed in the original NeRF work. We show that our modification improves reconstruction quality of hierarchical models, at the same time simplifying the training procedure by removing the need for auxiliary proposal network losses.

HIFI++: A Unified Framework for Bandwidth Extension and Speech Enhancement

2023 · CHAPTER · en

Generative adversarial networks have recently demonstrated outstanding performance in neural vocoding outperforming best autoregressive and flow-based models. In this paper, we show that this success can be extended to other tasks of conditional audio generation. In particular, building upon HiFi vocoders, we propose a novel HiFi++ general frame-work for bandwidth extension and speech enhancement. We show that with the improved generator architecture, HiFi++ performs better or comparably with the state-of-the-art in these tasks while spending significantly less computational resources. The effectiveness of our approach is validated through a series of extensive experiments.

DOI ↗

Star-Shaped Denoising Diffusion Probabilistic Models

2023 · CHAPTER · en

PDF ↗

To Stay or Not to Stay in the Pre-train Basin: Insights on Ensembling in Transfer Learning

2023 · CHAPTER · en

Transfer learning and ensembling are two popular techniques for improving the performance and robustness of neural networks. Due to the high cost of pre-training, ensembles of models fine-tuned from a single pre-trained checkpoint are often used in practice. Such models end up in the same basin of the loss landscape, which we call the pre-train basin, and thus have limited diversity. In this work, we show that ensembles trained from a single pre-trained checkpoint may be improved by better exploring the pre-train basin, however, leaving the basin results in losing the benefits of transfer learning and in degradation of the ensemble quality. Based on the analysis of existing exploration methods, we propose a more effective modification of the Snapshot Ensembles (SSE) for transfer learning setup, StarSSE, which results in stronger ensembles and uniform model soups.

DOI ↗

FFC-SE: Fast Fourier Convolution for Speech Enhancement

2022 · CHAPTER · en

Fast Fourier convolution (FFC) is the recently proposed neural operator showing promising performance in several computer vision problems. The FFC operator allows employing large receptive field operations within early layers of the neural network. It was shown to be especially helpful for inpainting of periodic structures which are common in audio processing. In this work, we design neural network architectures which adapt FFC for speech enhancement. We hypothesize that a large receptive field allows these networks to produce more coherent phases than vanilla convolutional models, and validate this hypothesis experimentally. We found that neural networks based on Fast Fourier convolution outperform analogous convolutional models and show better or comparable results with other speech enhancement baselines.

Variational Autoencoders for Precoding Matrices with High Spectral Efficiency

2022 · CHAPTER · en

Neural networks are used for channel decoding, channel detection, channel evaluation, and resource management in multi-input and multi-output (MIMO) wireless communication systems. In this paper, we consider the problem of finding precoding matrices with high spectral efficiency (SE) using variational autoencoder (VAE). We propose a computationally efficient algorithm for sampling precoding matrices with minimal loss of quality compared to the optimal precoding. In addition to VAE, we use the conditional variational autoencoder (CVAE) to build a unified generative model. Both of these methods are able to reconstruct the distribution of precoding matrices of high SE by sampling latent variables. This distribution obtained using VAE and CVAE methods is described in the literature for the first time.

DOI ↗

Training Scale-Invariant Neural Networks on the Sphere Can Happen in Three Regimes

2022 · CHAPTER · en

A fundamental property of deep learning normalization techniques, such as batch normalization, is making the pre-normalization parameters scale invariant. The intrinsic domain of such parameters is the unit sphere, and therefore their gradient optimization dynamics can be represented via spherical optimization with varying effective learning rate (ELR), which was studied previously. However, the varying ELR may obscure certain characteristics of the intrinsic loss landscape structure. In this work, we investigate the properties of training scale-invariant neural networks directly on the sphere using a fixed ELR. We discover three regimes of such training depending on the ELR value: convergence, chaotic equilibrium, and divergence. We study these regimes in detail both on a theoretical examination of a toy example and on a thorough empirical analysis of real scale-invariant deep learning models. Each regime has unique features and reflects specific properties of the intrinsic loss landscape, some of which have strong parallels with previous research on both regular and scale-invariant neural networks training. Finally, we demonstrate how the discovered regimes are reflected in conventional training of normalized networks and how they can be leveraged to achieve better optima.

HyperDomainNet: Universal Domain Adaptation for Generative Adversarial Networks

2022 · CHAPTER · en

Domain adaptation framework of GANs has achieved great progress in recent years as a main successful approach of training contemporary GANs in the case of very limited training data. In this work, we significantly improve this framework by proposing an extremely compact parameter space for fine-tuning the generator. We introduce a novel domain-modulation technique that allows to optimize only 6 thousand-dimensional vector instead of 30 million weights of StyleGAN2 to adapt to a target domain. We apply this parameterization to the state-of-art domain adaptation methods and show that it has almost the same expressiveness as the full parameter space. Additionally, we propose a new regularization loss that considerably enhances the diversity of the fine-tuned generator. Inspired by the reduction in the size of the optimizing parameter space we consider the problem of multi-domain adaptation of GANs, i.e. setting when the same model can adapt to several domains depending on the input query. We propose the HyperDomainNet that is a hypernetwork that predicts our parameterization given the target domain. We empirically confirm that it can successfully learn a number of domains at once and may even generalize to unseen domains. Source code can be found at this https URL

Курсы (2)

Байесовские методы в машинном обучении · 5 раза

2025/2026, 2024/2025, 2023/2024, 2022/2023, 2021/2022 · Бакалавриат / Бакалавриат направление: 38.03.01 Экономика / Дисциплина общефакультетского пула / Магистратура / Маго-лего · рус
Нейробайесовские методы в машинном обучении

2021/2022 · Магистратура · рус