• A
  • A
  • A
  • ABC
  • ABC
  • ABC
  • А
  • А
  • А
  • А
  • А
Regular version of the site

Beauty in Details: HSE University and AIRI Scientists Develop a Method for High-Quality Image Editing

Andy Warhol. Marilyn Diptych, 1962

Andy Warhol. Marilyn Diptych, 1962
crossarea.ru/art

Researchers from the HSE AI Research Centre, AIRI, and the University of Bremen have developed a new image editing method based on deep learning—StyleFeatureEditor. This tool allows for precise reproduction of even the smallest details in an image while preserving them during the editing process. With its help, users can easily change hair colour or facial expressions without sacrificing image quality. The results of this three-party collaboration were published at the highly-cited computer vision conference CVPR 2024.

Artificial intelligence is already able to generate and edit images using generative adversarial networks (GANs). The architecture consists of two independent networks: a generator that creates images and a discriminator that distinguishes between real and generated samples. These networks compete with each other, and a new stage in their development is the StyleGAN model. This model can generate images and modify specific parts based on user requests, but it has not been able to work with real photos or images before.

Researchers from the HSE AI Research Centre, the Artificial Intelligence Research Institute (AIRI), and the University of Bremen have proposed a method to quickly and efficiently edit real images. This StyleFeatureEditor approach consists of two modules: the first inverts (reconstructs) the original image, and the second edits this reconstruction. The results of these two steps are passed to StyleGAN, which generates the edited image based on the internal representations. The developers addressed some challenges that had been encountered in previous research. With a small set of representations, the network could edit the image well, but it lost some details from the original. However, with a larger set, all the details were preserved, but the network had difficulty transforming them correctly according to the task.

To solve this, the researchers proposed a new solution: the first module finds both large and small representations, while the second learns how to edit the larger ones using the smaller ones as reference.

However, to train these modules to accurately edit the representations, the neural network requires both real images and their edited versions.

‘We needed examples, such as the same face with different expressions, hairstyles, and details. Unfortunately, such image pairs do not exist at the moment. So, we came up with a trick: using a method that works with small representations, we created a reconstruction of a real image and an example of editing this reconstruction. Although the examples were relatively simple and without details, the model clearly understood how to make the edits,’ explains Denis Bobkov, one of the authors of the article, a research intern at the Centre of Deep Learning and Bayesian Methods of the AI and Digital Science Institute (part of the HSE Faculty of Computer Science), and a Junior Research Fellow at AIRI’s Fusion Brain Lab.

However, training only on generated (simple) examples leads to a loss of detail when working with real (complex) images. To prevent this, the researchers added real images to the training dataset, and the neural network learnt to reconstruct them in detail.

Thus, by showing the model how to edit both simple and complex images, the scientists created conditions under which the network could edit complex images more effectively. In particular, the developed approach handles adding new elements of style while preserving the details of the original image better than other existing methods.

Picture 1. Comparison of StyleFeatureEditor (SFE) with other methods on a detailed facial image dataset
© HSE University

In the case of simple reconstruction (first row), StyleFeatureEditor accurately reproduced a hat, while most other methods almost completely lost it. The developed method showed the best results with additional accessories (third row): most methods could add glasses, but only the StyleFeatureEditor retained the original eye colour.

‘Thanks to this training technique on generated data, we have obtained a model with high editing quality and a fast processing speed due to the use of relatively lightweight neural networks. The StyleFeatureEditor framework requires only 0.07 seconds to edit a single image,’ says Aibek Alanov, Head of the Centre of Deep Learning and Bayesian Methods of the AI and Digital Science Institute (part of the HSE Faculty of Computer Science), and leader of the research group ‘Controlled Generative AI’ at AIRI's Fusion Brain Lab.

The research was funded by a grant from the Analytical Centre under the Government of the Russian Federation for AI research centres.

The research results will be presented at the Fall into ML 2024 conference on artificial intelligence and machine learning, which will take place at HSE University on October 25–26, 2024. Leading AI scientists will discuss the best papers published at top-tier (A*) flagship AI conferences in 2024. A demo of the developed method can be tried out on HuggingFace, and the source code is available on GitHub.

See also:

Smoking Habit Affects Response to False Feedback

A team of scientists at HSE University, in collaboration with the Institute of Higher Nervous Activity and Neurophysiology of the Russian Academy of Sciences, studied how people respond to deception when under stress and cognitive load. The study revealed that smoking habits interfere with performance on cognitive tasks involving memory and attention and impairs a person’s ability to detect deception. The study findings have been published in Frontiers in Neuroscience.

Russian Physicists Determine Indices Enabling Prediction of Laser Behaviour

Russian scientists, including researchers at HSE University, examined the features of fibre laser generation and identified universal critical indices for calculating their characteristics and operating regimes. The study findings will help predict and optimise laser parameters for high-speed communication systems, spectroscopy, and other areas of optical technology. The paper has been published in Optics & Laser Technology.

Children with Autism Process Auditory Information Differently

A team of scientists, including researchers from the HSE Centre for Language and Brain, examined specific aspects of auditory perception in children with autism. The scientists observed atypical alpha rhythm activity both during sound perception and at rest. This suggests that these children experience abnormalities in the early stages of sound processing in the brain's auditory cortex. Over time, these abnormalities can result in language difficulties. The study findings have been published in Brain Structure and Function.

Smartphones Not Used for Digital Learning among Russian School Students

Despite the widespread use of smartphones, teachers have not fully integrated them into the teaching and learning process, including for developing students' digital skills. Irina Dvoretskaya, Research Fellow at the HSE Institute of Education, has examined the patterns of mobile device use for learning among students in grades 9 to 11.

Working while Studying Can Increase Salary and Chances of Success

Research shows that working while studying increases the likelihood of employment after graduation by 19% and boosts salary by 14%. One in two students has worked for at least a month while studying full time. The greatest benefits come from being employed during the final years of study, when students have the opportunity to begin working in their chosen field. These findings come from a team of authors at the HSE Faculty of Economic Sciences.

HSE Scientists Have Examined Potential Impact of Nuclear Power on Sustainable Development

Researchers at HSE University have developed a set of mathematical models to predict the impact of nuclear power on the Sustainable Development Index. If the share of nuclear power in the global energy mix increases to between 20% and 25%, the global Sustainable Development Index (SDI) is projected to grow by one-third by 2050. In scenarios where the share of nuclear power grows more slowly, the increase in the SDI is found to be lower. The study has been published in Nuclear Energy and Technology.

HSE Scientists Have Developed a New Model of Electric Double Layer

This new model accounts for a wide range of ion-electrode interactions and predicts a device's ability to store electric charge. The model's theoretical predictions align with the experimental results. Data on the behaviour of the electric double layer (EDL) can aid in the development of more efficient supercapacitors for portable electronics and electric vehicles. The study has been published in ChemPhysChem

Psychologists from HSE University Discovered How Love for Animals Affects Relationships with People

Researchers from HSE University have identified a connection between attachment to pets and attitudes toward nature and other people. The study found that the more joy people derive from interacting with their pets, the more they want to help others. However, love for animals is not always associated with concern for nature. The findings were published in the Social Psychology and Society journal.

HSE Scientists Propose Using Heart Rate Analysis to Diagnose Anxiety and Depression

A group of scientists at HSE University have discovered how anxiety and depression can be diagnosed by analysing heart rate. It turns out that under mental stress, the heart rate of individuals with a predisposition to mental health disorders differs from that of healthy individuals, especially when performing more complex tasks. These changes in cardiovascular parameters can even be detected using a pulse oximeter or a smartwatch. The study findings have been published in Frontiers in Psychiatry.

Researchers at HSE in St Petersburg Develop Superior Machine Learning Model for Determining Text Topics

Topic models are machine learning algorithms designed to analyse large text collections based on their topics. Scientists at HSE Campus in St Petersburg compared five topic models to determine which ones performed better. Two models, including GLDAW developed by the Laboratory for Social and Cognitive Informatics at HSE Campus in St Petersburg, made the lowest number of errors. The paper has been published in PeerJ Computer Science.