HSE Scientists Optimise Training of Generative Flow Networks

Researchers at the HSE Faculty of Computer Science have optimised the training method for generative flow neural networks to handle unstructured tasks, which could make the search for new drugs more efficient. The results of their work were presented at ICLR 2025, one of the world’s leading conferences on machine learning. The paper is available at Arxiv.org.
Generative Flow Networks (GFlowNets) are a class of machine learning algorithms that build complex objects step by step. Researchers use them to search for new proteins and drugs, and to optimise transport systems.
For GFlowNets to discover such complex structures, researchers specify the desired properties of the target object. The closer the network’s proposed solution is to these properties, the higher the reward it receives. GFlowNets aim to solve problems in a way that maximises their reward. They do not rely on data directly, but instead on the reward, which is computed using an equation known as the value function.
The search for a complex object can be compared to assembling a Lego model, where pieces are added step by step until the object is complete, with each model assigned a specific value—for example, a plant model might be valued higher than an animal model. Unlike other machine learning methods that would strive to construct a plant at any cost, GFlowNets generate a variety of objects—but plants more frequently than animals—because the reward for plants is higher.
In this type of search, GFlowNets rely on two stochastic policies that operate together: a forward policy and a backward policy. The forward policy can be thought of as a construction foreman, deciding the next step and estimating the probability of the subsequent state, while the backward policy acts as a deconstruction expert, identifying the preceding step. Maintaining balance between these flows is crucial but difficult to achieve. First, it requires significant computing power. Second, backward policies lack flexibility: researchers usually prevent them from adapting during the search or from observing the actions of the forward policy.
HSE scientists have developed a way to optimise backward policies using a method called Trajectory Likelihood Maximisation (TLM). They refined the backward policy’s algorithms so that it can be continuously checked against the steps of the forward policy.
'We designed the search for the optimal solution to resemble a negotiation, where both sides are ready to adjust their positions. In highly uncertain problems, the backward policy serves only as an auxiliary tool that improves the results of the forward policy. Our goal was to make the backward policy more flexible, and we finally succeeded,' explains Timofey Gritsaev, co-author of the paper and Research Assistant of the Centre for Deep Learning and Bayesian Methods at the HSE FCS AI and Digital Science Institute.
After implementing TLM, the reward function that measures the backward model’s success became more complex. Nevertheless, despite this increased complexity, the overall search system became faster and more efficient.
'Our method explores the space of possible solutions noticeably faster and identifies more high-quality options. Overall, this approach brings generative models closer to reinforcement learning methods,' explains Nikita Morozov, Junior Research Fellow of the Centre for Deep Learning and Bayesian Methods at the AI and Digital Science Institute of the HSE FCS.
The authors of the study are confident that their work will benefit specialists using GFlowNets across various fields, including the search for new medicinal compounds, the development of materials with specific properties, and the fine-tuning of large language models. Thanks to these networks’ ability to efficiently explore vast solution spaces and quickly identify the best options, the demand on computing power can be significantly reduced.
See also:
Scientists Discover That the Brain Responds to Others’ Actions as if They Were Its Own
When we watch someone move their finger, our brain doesn’t remain passive. Research conducted by scientists from HSE University and Lausanne University Hospital shows that observing movement activates the motor cortex as if we were performing the action ourselves—while simultaneously ‘silencing’ unnecessary muscles. The findings were published in Scientific Reports.
Russian Scientists Investigate Age-Related Differences in Brain Damage Volume Following Childhood Stroke
A team of Russian scientists and clinicians, including Sofya Kulikova from HSE University in Perm, compared the extent and characteristics of brain damage in children who experienced a stroke either within the first four weeks of life or before the age of two. The researchers found that the younger the child, the more extensive the brain damage—particularly in the frontal and parietal lobes, which are responsible for movement, language, and thinking. The study, published in Neuroscience and Behavioral Physiology, provides insights into how age can influence the nature and extent of brain lesions and lays the groundwork for developing personalised rehabilitation programmes for children who experience a stroke early in life.
Scientists Test Asymmetry Between Matter and Antimatter
An international team, including scientists from HSE University, has collected and analysed data from dozens of experiments on charm mixing—the process in which an unstable charm meson oscillates between its particle and antiparticle states. These oscillations were observed only four times per thousand decays, fully consistent with the predictions of the Standard Model. This indicates that no signs of new physics have yet been detected in these processes, and if unknown particles do exist, they are likely too heavy to be observed with current equipment. The paper has been published in Physical Review D.
HSE Scientists Reveal What Drives Public Trust in Science
Researchers at HSE ISSEK have analysed the level of trust in scientific knowledge in Russian society and the factors shaping attitudes and perceptions. It was found that trust in science depends more on everyday experience, social expectations, and the perceived promises of science than on objective knowledge. The article has been published in Universe of Russia.
Institute for Robotics Systems Established at HSE University
As decided by the HSE University Academic Council, a new Institute for Robotics Systems will be established at HSE, and with a strong fundamental base. It will cooperate with relevant departments across the university and engage students and doctoral candidates in research and development (R&D). First Vice Rector of HSE University and Director of the Institute for Statistical Studies and Economics of Knowledge, Leonid Gokhberg, discussed the expected practical results and the framework for cooperation with an industrial partner.
HSE Seeks New Ideas for AI Agents: Initiative Competition Launched
HSE University is inviting researchers and lecturers to present concepts for new digital products based on artificial intelligence. The best projects will receive expert and technological support. Applications are open until December 19, 2025.
IDLab: Fascinating Research, Tough Deadlines, and Academic Drive
The International Laboratory of Intangible-driven Economy (IDLab) was established at the HSE campus in Perm 11 years ago. Its expertise in data processing and analysis allows researchers to combine fundamental studies with applied projects, including the development of risk and cybersecurity models for Sber. The head of the laboratory, Professor Petr Parshakov, and Senior Research Fellow Professor Mariya Molodchik spoke to the HSE News Service about IDLab’s work.
HSE Lecturers Awarded Yandex ML Prize 2025
The Yandex ML Prize is awarded to lecturers and heads of educational programmes who contribute to the development of artificial intelligence in Russia. This year, 10 laureates were selected from 300 applicants, including three members of the HSE Faculty of Computer Science (FCS). A special Hall of Fame award was also presented for contributions to the establishment of machine learning as an academic field. One of the recipients was Dmitry Vetrov, Research Professor at the HSE FCS.
HSE Tops Ranking of Universities Participating in Priority 2030 Programme
The Russian Ministry of Science and Higher Education has published an updated list of participants in the Priority 2030 programme. A total of 106 universities will receive support this year. HSE University was included in the first group and topped the ranking.
HSE University and Banking and Finance Academy of Uzbekistan Sign Memorandum on Scientific Cooperation
The partnership aims to foster academic collaboration in the fields of global economics, sustainable development, and Islamic finance. Strengthening academic ties with Uzbekistan represents a promising direction for scientific exchanges and the implementation of international projects in sustainable development.


