scholarly journals Online summarization of dynamic graphs using subjective interestingness for sequential data

Author(s):  
Sarang Kapoor ◽  
Dhish Kumar Saxena ◽  
Matthijs van Leeuwen

Abstract Many real-world phenomena can be represented as dynamic graphs, i.e., networks that change over time. The problem of dynamic graph summarization, i.e., to succinctly describe the evolution of a dynamic graph, has been widely studied. Existing methods typically use objective measures to find fixed structures such as cliques, stars, and cores. Most of the methods, however, do not consider the problem of online summarization, where the summary is incrementally conveyed to the analyst as the graph evolves, and (thus) do not take into account the knowledge of the analyst at a specific moment in time. We address this gap in the literature through a novel, generic framework for subjective interestingness for sequential data. Specifically, we iteratively identify atomic changes, called ‘actions’, that provide most information relative to the current knowledge of the analyst. For this, we introduce a novel information gain measure, which is motivated by the minimum description length (MDL) principle. With this measure, our approach discovers compact summaries without having to decide on the number of patterns. As such, we are the first to combine approaches for data mining based on subjective interestingness (using the maximum entropy principle) with pattern-based summarization (using the MDL principle). We instantiate this framework for dynamic graphs and dense subgraph patterns, and present DSSG, a heuristic algorithm for the online summarization of dynamic graphs by means of informative actions, each of which represents an interpretable change to the connectivity structure of the graph. The experiments on real-world data demonstrate that our approach effectively discovers informative summaries. We conclude with a case study on data from an airline network to show its potential for real-world applications.

2020 ◽  
Vol 7 (Supplement_1) ◽  
pp. S701-S701
Author(s):  
Anne M Butler ◽  
Jason Newland ◽  
John Sahrmann ◽  
Caroline O’Neil ◽  
Sena Sayood ◽  
...  

Abstract Background Vaccine hesitancy is increasingly common, but more information is needed on patterns of childhood vaccination. We characterized patterns of vaccine delay among commercially-insured children in the U.S. Methods Using the IBM MarketScan Commercial Database, we identified infants who received a timely first dose of diphtheria-tetanus-acellular pertussis (DTaP) vaccine from October 2009 to June 2017. We used CPT codes to collect vaccine administration history on antigen, formulation, dose, and date. We ascertained injectable and oral vaccine antigens (DTaP, polio, pneumococcal conjugate, rotavirus, Haemophilus influenza type b (Hib), measles, mumps, rubella, varicella). Timely receipt was defined as concomitant administration of the CDC-recommended number of antigens during the following time windows: 2, 4, 6, and 12-15 months of age (grace period: -7, +21 days). We generated heat maps to illustrate age distributions at receipt of specific antigens and doses. We created Sankey diagrams to illustrate the number of antigens received concomitantly during each time window and depict transitions to different states over time (e.g., no vaccine delay to vaccine delay). For each antigen and dose, we estimated the cumulative incidence of receipt. Results Among 1,066,216 eligible infants, the majority (84%) concomitantly received all 5 CDC-recommended antigens at 2 months of age while others only received 1 (1%), 2 (2%), 3 (4%) or 4 (9%) antigens. Many vaccinations were delayed – 30% and 39% of children did not receive all recommended antigens concomitantly at 4 and 6 months, respectively. The heat map shows wide variation in age at vaccination. For several antigens including Hib, measles, mumps, rotavirus, rubella, and varicella, the cumulative incidence increased steeply at ≥2 time points, suggesting vaccine delay for some infants (e.g., the first dose of Hib was administered to 85% of infants by 2 months of age, with subsequent small but distinct increases at 4, 6, 12, and 15 months of age). Conclusion Using real-world data to study early childhood vaccination patterns, we observed evidence of substantial deviation from the CDC-recommended schedule. These results expand current knowledge on the magnitude and timing of antigen- and dose-specific vaccine delay on a population level. Disclosures Jason Newland, MD, MEd, FPIDS, Merck (Grant/Research Support)Pfizer (Other Financial or Material Support, Industry funded clinical trial) Leah McGrath, PhD, NoviSci, Inc. (Employee)


2020 ◽  
Vol 34 (04) ◽  
pp. 5101-5108
Author(s):  
Xiao Ma ◽  
Peter Karkus ◽  
David Hsu ◽  
Wee Sun Lee

Recurrent neural networks (RNNs) have been extraordinarily successful for prediction with sequential data. To tackle highly variable and multi-modal real-world data, we introduce Particle Filter Recurrent Neural Networks (PF-RNNs), a new RNN family that explicitly models uncertainty in its internal structure: while an RNN relies on a long, deterministic latent state vector, a PF-RNN maintains a latent state distribution, approximated as a set of particles. For effective learning, we provide a fully differentiable particle filter algorithm that updates the PF-RNN latent state distribution according to the Bayes rule. Experiments demonstrate that the proposed PF-RNNs outperform the corresponding standard gated RNNs on a synthetic robot localization dataset and 10 real-world sequence prediction datasets for text classification, stock price prediction, etc.


2021 ◽  
Author(s):  
Asher Wasserman ◽  
Al Musella ◽  
Mark Shapiro ◽  
Jeff Shrager

Randomized controlled trials (RCTs) offer a clear causal interpretation of treatment effects, but are inefficient in terms of information gain per patient. Moreover, because they are intended to test cohort-level effects, RCTs rarely provide information to support precision medicine, which strives to choose the best treatment for an individual patient. If causal information could be efficiently extracted from widely available real-world data, the rapidity of treatment validation could be increased, and its costs reduced. Moreover, inferences could be made across larger, more diverse patient populations. We created a "virtual trial" by fitting a multilevel Bayesian survival model to treatment and outcome records self-reported by 451 brain cancer patients. The model recovers group-level treatment effects comparable to RCTs representing over 3200 patients. The model additionally discovers the feature-treatment interactions needed to make individual-level predictions for precision medicine. By learning causally-valid information from heterogeneous real-world data, without artificially-limiting the patient population, virtual trials can generate more information from fewer patients, demonstrating their value as a complement to large randomized controlled trials, especially in highly heterogeneous or rare diseases.


2016 ◽  
Vol 116 (S 02) ◽  
pp. S13-S23 ◽  
Author(s):  
A. Camm ◽  
Craig Coleman ◽  
CAPT Tamayo ◽  
Jan Beyer-Westendorf

SummaryRandomised controlled trials (RCTs) are considered the gold standard of clinical research as they use rigorous methodologies, detailed protocols, pre-specified statistical analyses and well-defined patient cohorts. However, RCTs do not take into account the complexity of real-world clinical decision-making. To tackle this, real-world data are being increasingly used to evaluate the long-term safety and effectiveness of a given therapy in routine clinical practice and in patients who may not be represented in RCTs, addressing key clinical questions that may remain. Real-world evidence plays a substantial role in supporting the use of non-vitamin K antagonist (VKA) oral anticoagulants (NOACs) in clinical practice. By providing data on patient profiles and the use of anticoagulation therapies in routine clinical practice, real-world evidence expands the current awareness of NOACs, helping to ensure that clinicians are well-informed on their use to implement patient-tailored clinical decisions. There are various issues with current anticoagulation strategies, including under- or overtreatment and frequent monitoring with VKAs. Real-world studies have demonstrated that NOAC use is increasing (Dresden NOAC registry and Global Anticoagulant Registry in the FIELD-AF [GARFIELD-AF]), as well as reaffirming the safety and effectiveness of rivaroxaban previously observed in RCTs (XArelto on preveNtion of sTroke and non-central nervoUS system systemic embolism in patients with non-valvular atrial fibrillation [XANTUS] and IMS Disease Analyzer). This article will describe the latest updates in real-world evidence across a variety of methodologies, such as non-interventional studies (NIS), registries and database analyses studies. It is anticipated that these studies will provide valuable clinical insights into the management of thromboembolism, and enhance the current knowledge on anticoagulant use and outcomes for patients.


2016 ◽  
Vol 22 ◽  
pp. 219
Author(s):  
Roberto Salvatori ◽  
Olga Gambetti ◽  
Whitney Woodmansee ◽  
David Cox ◽  
Beloo Mirakhur ◽  
...  

VASA ◽  
2019 ◽  
Vol 48 (2) ◽  
pp. 134-147 ◽  
Author(s):  
Mirko Hirschl ◽  
Michael Kundi

Abstract. Background: In randomized controlled trials (RCTs) direct acting oral anticoagulants (DOACs) showed a superior risk-benefit profile in comparison to vitamin K antagonists (VKAs) for patients with nonvalvular atrial fibrillation. Patients enrolled in such studies do not necessarily reflect the whole target population treated in real-world practice. Materials and methods: By a systematic literature search, 88 studies including 3,351,628 patients providing over 2.9 million patient-years of follow-up were identified. Hazard ratios and event-rates for the main efficacy and safety outcomes were extracted and the results for DOACs and VKAs combined by network meta-analysis. In addition, meta-regression was performed to identify factors responsible for heterogeneity across studies. Results: For stroke and systemic embolism as well as for major bleeding and intracranial bleeding real-world studies gave virtually the same result as RCTs with higher efficacy and lower major bleeding risk (for dabigatran and apixaban) and lower risk of intracranial bleeding (all DOACs) compared to VKAs. Results for gastrointestinal bleeding were consistently better for DOACs and hazard ratios of myocardial infarction were significantly lower in real-world for dabigatran and apixaban compared to RCTs. By a ranking analysis we found that apixaban is the safest anticoagulant drug, while rivaroxaban closely followed by dabigatran are the most efficacious. Risk of bias and heterogeneity was assessed and had little impact on the overall results. Analysis of effect modification could guide the clinical decision as no single DOAC was superior/inferior to the others under all conditions. Conclusions: DOACs were at least as efficacious as VKAs. In terms of safety endpoints, DOACs performed better under real-world conditions than in RCTs. The current real-world data showed that differences in efficacy and safety, despite generally low event rates, exist between DOACs. Knowledge about these differences in performance can contribute to a more personalized medicine.


Sign in / Sign up

Export Citation Format

Share Document