Researchers from the University of Zurich (UZH) have used convolutional neural networks (CNNs) to effectively filter noise from detector images and hence make scattering signals measured at PETRA III visible that would otherwise be virtually invisible. By using data from the beamline P21.1, the group around Johan Chang, professor at the Physics institute at UZH, trained a deep CNN system such that a 20-fold reduction of the measurement time yielded equally accurate physical information as the long measurement. In their publication in the recent issue of the journal Nature Machine Intelligence the international team including scientists from DESY showcases the effectiveness of their approach for a single-crystal diffraction experiment and demonstrates its potential for faster data collection for other scattering techniques as well.
In the class of correlated electron materials, the electrons of the outer atomic orbitals interact strongly with each other and influence each other in complex ways. As a result, these materials show a wide variation of macroscopic properties under conditions such as low temperature and high pressure. For instance, the electrical conductivity may change from insulator to superconductor in one and the same compound. It takes looking into very subtle structural features to understand the occurrence and interplay of the different states. The prototypical high-temperature superconductor La1.88Sr0.12CuO4 – investigated in this work – shows a so-called charge-density-wave (CDW) order which is present at low temperatures only. In diffraction images taken at X-ray light sources like PETRA III the effect reveals itself in creating very weak superlattice reflections perpendicular to the strong Bragg reflections. “The modern photon-counting area detectors easily record the strong and the weak diffraction signal in one frame,” says Martin v. Zimmermann, beamline manager at PETRA III beamline P21.1. “However, the noise in the detector images stemming from parasitic scattering of air and the cryostat windows creates similar intensities on the detector as the CDW signal and essentially buries it when we take data for very short times.”
For a more reliable identification of weaker signals and improving the signal-to-noise ratio, the researchers set up two different neural networks with the aim to “de-noise” the measured data. “As a first step, we collected a dataset consisting of pairs of low-count (LC) and high-count (HC) images with 1 and 20 seconds exposure times respectively,” says Julia Küspert, PhD student in the Chang group. Jens Oppliger, first author of the study and also PhD student in the group, further explains the applied procedure: “We then trained a deep CNN to enhance the weak CDW signal in the noisy low-count data in a supervised machine learning scheme using the corresponding high-count images as reference. Training with more than three thousand such pairs of detector frames took around 10 to 20 hours on a moderate GPU with 10 GB of VRAM. Once trained, the network produces de-noised frames within a fraction of a second.” Although the noise distribution is rather well understood for scattering experiments with 2D detectors, it turned out to be important to work with real LC data as it contains additional noise components from various experimental sources such as readout noise. “We realised that adding simulated noise to the HC images resulted in less reliable noise filtering than training with experimental LC data,” says Jens Oppliger.
Accelerated data acquisition does not only facilitate a highly efficient use of valuable beamtime at synchrotron or free-electron-laser beamlines but it also paves the way to ever more advanced studies of radiation-sensitive samples and ultrafast processes. Generally, the reduced data collection time minimises the X-ray dose accumulated in a sample. Wherever long exposures to X-rays destroy the specimen, e.g. biological tissue or metastable materials, the presented noise filter offers the potential to gain insights into delicate structures that have been inaccessible so far. Very importantly, data-denoising assisted by artificial intelligence (AI) enables the detection of very rapid structural changes that happen e.g. in pulsed high magnetic fields or under laser irradiation with a high repetition rate. “Other traditional noise-reduction techniques applied in imaging or acoustics, such as low-pass and Gaussian filters, oftentimes lead to noticeable visual artifacts for images with high noise levels. Our CNN-based noise filter largely avoids such issues while preserving the scientific content of the measurement. That means we do not modify any part of the physical information contained in the detector images and instead utilise the trained AI model to separate signal from noise,” says Jens Oppliger. The introduced method is not based on any specifics of diffraction and scattering data; hence the research team foresees to test the capabilities of the system to extract weak signals from scattering, spectroscopy, and electron microscopy data in a next step. On the long run, a magnitude of scientific communities could benefit from such kind of AI-based tools designed to efficiently remove high levels of noise.
(partly from DESY News)
Reference:
J. Oppliger, M. M. Denner, J. Küspert et al., Weak signal extraction enabled by deep neural network denoising of diffraction data. Nature Machine Intelligence 6, (2024), DOI: 10.1038/s42256-024-00790-1