subgroup discovery
Recently Published Documents


TOTAL DOCUMENTS

197
(FIVE YEARS 42)

H-INDEX

24
(FIVE YEARS 3)

Cancers ◽  
2021 ◽  
Vol 13 (24) ◽  
pp. 6278
Author(s):  
Zainab Al-Taie ◽  
Mark Hannink ◽  
Jonathan Mitchem ◽  
Christos Papageorgiou ◽  
Chi-Ren Shyu

Breast cancer (BC) is the leading cause of death among female patients with cancer. Patients with triple-negative breast cancer (TNBC) have the lowest survival rate. TNBC has substantial heterogeneity within the BC population. This study utilized our novel patient stratification and drug repositioning method to find subgroups of BC patients that share common genetic profiles and that may respond similarly to the recommended drugs. After further examination of the discovered patient subgroups, we identified five homogeneous druggable TNBC subgroups. A drug repositioning algorithm was then applied to find the drugs with a high potential for each subgroup. Most of the top drugs for these subgroups were chemotherapy used for various types of cancer, including BC. After analyzing the biological mechanisms targeted by these drugs, ferroptosis was the common cell death mechanism induced by the top drugs in the subgroups with neoplasm subdivision and race as clinical variables. In contrast, the antioxidative effect on cancer cells was the common targeted mechanism in the subgroup of patients with an age less than 50. Literature reviews were used to validate our findings, which could provide invaluable insights to streamline the drug repositioning process and could be further studied in a wet lab setting and in clinical trials.


2021 ◽  
Author(s):  
Lucas Foppa ◽  
Christopher Sutton ◽  
Luca M. Ghiringhelli ◽  
Sandip De ◽  
Patricia Löser ◽  
...  

The design of heterogeneous catalysts is challenged by the complexity of materials and processes that govern reactivity and by the fact that the number of good catalysts is very small compared to the number of possible materials. Here, we show how the subgroup-discovery (SGD) artificial-intelligence approach can be applied to an experimental plus theoretical data set to identify constraints on key physicochemical parameters, the so-called SG rules, which exclusively describe materials and reaction conditions with outstanding catalytic performance. By using high-hroughput experimentation, 120 SiO 2 -supported catalysts containing ruthenium, tungsten and phosphorus were synthesized and tested in the catalytic oxidation of propylene. As candidate descriptive parameters, the temperature and ten parameters related to the composition and chemical nature of the catalyst materials, derived from calculated free-atom properties, were offered. The temperature, the phosphorus content, and the composition-weighted electronegativity are identified as key parameters describing high yields towards the value-added oxygenate products acrolein and acrylic acid. The SG rules not only reflect the underlying processes particularly associated to high performance but also guide the design of more complex catalysts containing up to five elements in their composition.


2021 ◽  
Author(s):  
Romain Mathonat ◽  
Diana Nurbakova ◽  
Jean-Francois Boulicaut ◽  
Mehdi Kaytoue

Author(s):  
Tatiana Makhalova ◽  
Sergei O. Kuznetsov ◽  
Amedeo Napoli

AbstractPattern mining is well established in data mining research, especially for mining binary datasets. Surprisingly, there is much less work about numerical pattern mining and this research area remains under-explored. In this paper we propose Mint, an efficient MDL-based algorithm for mining numerical datasets. The MDL principle is a robust and reliable framework widely used in pattern mining, and as well in subgroup discovery. In Mint we reuse MDL for discovering useful patterns and returning a set of non-redundant overlapping patterns with well-defined boundaries and covering meaningful groups of objects. Mint is not alone in the category of numerical pattern miners based on MDL. In the experiments presented in the paper we show that Mint outperforms competitors among which IPD, RealKrimp, and Slim.


2021 ◽  
Author(s):  
Christoph Kiefer ◽  
Florian Lemmerich ◽  
Benedikt Langenebrg ◽  
Axel Mayer

Structural equation modeling (SEM) is one of the most popular statistical frameworks in the social and behavioural sciences. Often, detection of groups with distinct sets ofparameters in structural equation models (SEM) are of key importance for appliedresearchers, for example, when investigating differential item functioning for a mentalability test or examining children with exceptional educational trajectories. In this paper, we present a new approach combining subgroup discovery – a well-established toolkit of supervised learning algorithms and techniques from the field of computer science – with structural equation models. We provide an introduction how subgroup discovery can be applied to detect subgroups with exceptional parameter constellations in structural equation models based on user-defined interestingness measures. Furthermore, technical details on the algorithmic components, efficiency, and further computational aspects are presented. Then, our approach is illustrated with two real-world data examples, examining measurement invariance of a mental ability test and investigating interesting subgroups for the mediated relationship between predictors of educational outcomes and the trajectories of math competencies in 5th grade children. The illustrative examples are accompanied bya short introduction in the R package subgroupsem, which is a viable implementation of our approach for applied researchers.


Author(s):  
Lucas Foppa ◽  
Luca M. Ghiringhelli

AbstractIn order to estimate the reactivity of a large number of potentially complex heterogeneous catalysts while searching for novel and more efficient materials, physical as well as data-centric models have been developed for a faster evaluation of adsorption energies compared to first-principles calculations. However, global models designed to describe as many materials as possible might overlook the very few compounds that have the appropriate adsorption properties to be suitable for a given catalytic process. Here, the subgroup-discovery (SGD) local artificial-intelligence approach is used to identify the key descriptive parameters and constrains on their values, the so-called SG rules, which particularly describe transition-metal surfaces with outstanding adsorption properties for the oxygen-reduction and -evolution reactions. We start from a data set of 95 oxygen adsorption-energy values evaluated by density-functional-theory calculations for several monometallic surfaces along with 16 atomic, bulk and surface properties as candidate descriptive parameters. From this data set, SGD identifies constraints on the most relevant parameters describing materials and adsorption sites that (i) result in O adsorption energies within the Sabatier-optimal range required for the oxygen-reduction reaction and (ii) present the largest deviations from the linear-scaling relations between O and OH adsorption energies, which limit the catalyst performance in the oxygen-evolution reaction. The SG rules not only reflect the local underlying physicochemical phenomena that result in the desired adsorption properties, but also guide the challenging design of alloy catalysts.


2021 ◽  
Author(s):  
Aliaksei Mazheika ◽  
Yanggang Wang ◽  
Rosendo Valero ◽  
Francesc Vines ◽  
Francesc Illas ◽  
...  

Abstract Using subgroup discovery, an artificial intelligence (AI) approach that identifies statistically exceptional subgroups in a dataset, we develop a strategy for a rational design of catalytic materials. We identify “materials genes” (features of catalyst materials) that correlate with mechanisms that trigger, facilitate, or hinder the activation of carbon dioxide (CO2) towards a chemical conversion. The approach is used to address the conversion of CO2 to fuels and other useful chemicals. The AI model is trained on high-throughput first-principles data for a broad family of oxides. We demonstrate that bending of the gas-phase linear molecule, previously proposed as the indicator of activation, is insufficient to account for the good catalytic performance of experimentally characterized oxide surfaces. Instead, our AI approach identifies the common feature of these surfaces in the binding of a molecular O atom to a surface cation, which results in a strong elongation and therefore weakening of one molecular C-O bond. The same conclusion is obtained by using the bending indicator only when incombination with the Sabatier principle. Based on these findings, we propose a set of new promising oxide-based catalyst materials for CO2 conversion, and a recipe to find more. Our analysis also reveals advantages of local pattern discovery methods such as subgroup discovery over standard global regression approaches in discovering combinations of materials properties that result in a catalytic activation.


Sign in / Sign up

Export Citation Format

Share Document