Смирнов Феликс Александрович

Московский институт электроники и математики им. А.Н. Тихонова

Профиль на hse.ru ↗ тел.: 15016

Публикаций

Языков

Наград

Конференций

Профиль Публикации (2) Курсы (0)

Профессиональные интересы

системное программированиевысокопроизводительные вычислениявычислительная линейная алгебрасуперкомпьютерные технологии

Должности

Стажер-исследователь — Московский институт электроники и математики им. А.Н. Тихонова, Международная лаборатория суперкомпьютерного атомистического моделирования и многомасштабного анализа
Аспирант — Московский институт электроники и математики им. А.Н. Тихонова, Международная лаборатория суперкомпьютерного атомистического моделирования и многомасштабного анализа

Био

· Начал работать в НИУ ВШЭ в 2024 году.

Образование

2025 · Магистратура: Национальный исследовательский университет "Высшая школа экономики", специальность «Прикладная математика и информатика», квалификация «Магистр»
2023 · Бакалавриат: Московский технический университет связи и информатики, специальность «Фундаментальная информатика и информационные технологии», квалификация «Бакалавр»

Опыт работы

· Практикант-бакалавр группы по разработке программного обеспечения,
· АО «Интел А/О»
· Стажер-исследователь,
· Объединенный институт высоких температур РАН
· Стажер-исследователь,
· Национальный исследовательский университет «Высшая школа экономики»

Гранты и проекты

— · Участие в гранте РНФ (проект № 20-71-10127, «Анализ и оптимизация энергоэффективности и производительности суперкомпьютеров на перспективной элементной базе отечественного производства», организация ОИВТ РАН)

Конференции (3)

Показать все

· 2025: XXIX Межвузовская научно-техническая конференция студентов, аспирантов и молодых специалистов им. Е.В. Арменского (Москва). Доклад: Подходы к оптимизации производительности параллельных кодов с GPU-aware MPI
· 2024: X Международная конференция "Суперкомпьютерные дни в России" (Москва). Доклад: Анализ применения технологии GPU-aware MPI для сети Ангара: первые результаты
· 2024: 15th International Conference on Parallel Processing & Applied Mathematics (Острава). Доклад: Tracing of GPU-aware MPI Applications: the First Benchmarks for the Angara Interconnect

Идентификаторы исследователя

ORCID: 0009-0002-2478-2484
ResearcherID: KHW-6709-2024

Публикации (2)

Towards performance analysis of GPU-aware MPI over Angara interconnect

2026 · ARTICLE · en

One of the most important aspects of supercomputer development in the post-Moore era is the interconnect technologies that allow one to unite a multitude of processing elements into a well-synchronized computing system. Novel types of supercomputer interconnect require careful benchmarking and compliance with the requirements of modern hardware trends. GPU-based heterogeneous computing is one of the most important current avenues for building high performance computing systems, and the support of GPU-aware MPI technology is a requirement for any competitive interconnect. In this paper, we describe a UCX API based GPU-aware MPI implementation for the Angara interconnect. Performance analysis for peer-to-peer, MPI_Bcast and MPI_Reduce operations is presented, as well as for the rocHPL benchmark and for a typical biomolecular model within the LAMMPS molecular dynamics code. The deployment of the Desmos supercomputer equipped with both Angara and InfiniBand FDR allows us to make an accurate comparison of these two types of interconnect using the latter as a reference.

DOI ↗

Tracing of GPU-Aware MPI Applications: First Benchmarks for the Angara Interconnect

2025 · CHAPTER · en

The efficiency of data transfer is one of the most important issues of supercomputer development in the post-Moore era. The rise of heterogeneous computing systems introduces such complicated patterns of data transfers as, for instance, the GPU-aware MPI technology. The practical deployment of this technology in applications requires the development of the dedicated system software as well as the analysis tools for tracing the runtime behavior of the corresponding applied algorithms. In this work we present the UCX API for GPU-aware MPI implementation over Angara interconnect and analyze the execution patterns of the rocHPL benchmark using the Score-P infrastructure. This analysis allows us to make a comparison of the GPU-aware MPI implementation for the Angara interconnect with the InfiniBand implementation.

DOI ↗

Курсы (0)

Нет курсов.