VM-NSP

2021 ◽  
Vol 39 (2) ◽  
pp. 1-27
Author(s):  
Wei Wang ◽  
Longbing Cao

Negative sequential patterns (NSPs) capture more informative and actionable knowledge than classic positive sequential patterns (PSPs) due to the involvement of both occurring and nonoccurring behaviors and events, which can contribute to many relevant applications. However, NSP mining is nontrivial, as it involves fundamental challenges requiring distinct theoretical foundations and is not directly addressable by PSP mining. In the very limited research reported on NSP mining, a negative element constraint (NEC) is incorporated to only consider the NSPs composed of specific forms of elements (containing either positive or negative items), which results in many valuable NSPs being missed. Here, we loosen the NEC (called loose negative element constraint (LNEC)) to include partial negative elements containing both positive and negative items, which enables the discovery of more flexible patterns but incorporates significant new learning challenges, such as representing and mining complete NSPs. Accordingly, we formalize the LNEC-based NSP mining problem and propose a novel vertical NSP mining framework , VM-NSP, to efficiently mine the complete set of NSPs by a vertical representation (VR) of each sequence. An efficient bitmap-based vertical NSP mining algorithm , bM-NSP, introduces a bitmap hash table--based VR and a prefix-based negative sequential candidate generation strategy to optimize the discovery performance. VM-NSP and its implementation bM-NSP form the first VR-based approach for complete NSP mining with LNEC. Theoretical analyses and experiments confirm the performance superiority of bM-NSP on synthetic and real-life datasets w.r.t. diverse data factors, which substantially expands existing NSP mining methods toward flexible NSP discovery.

Author(s):  
Tiantian Xu ◽  
Xiangjun Dong ◽  
Jianliang Xu ◽  
Xue Dong

High utility sequential patterns (HUSP) refer to those sequential patterns with high utility (such as profit), which play a crucial role in many real-life applications. Relevant studies of HUSP only consider positive values of sequence utility. In some applications, however, a sequence consists of items with negative values (NIV). For example, a supermarket sells a cartridge with negative profit in a package with a printer at higher positive return. Although a few methods have been proposed to mine high utility itemsets (HUI) with NIV, they are not suitable for mining HUSP with NIV because an item may occur more than once in a sequence and its utility may have multiple values. In this paper, we propose a novel method High Utility Sequential Patterns with Negative Item Values (HUSP-NIV) to efficiently mine HUSP with NIV from sequential utility-based databases. HUSP-NIV works as follows: (1) using the lexicographic quantitative sequence tree (LQS-tree) to extract the complete set of high utility sequences and using I-Concatenation and S-Concatenation mechanisms to generate newly concatenated sequences; (2) using three pruning methods to reduce the search space in the LQS-tree; (3) traversing LQS-tree and outputting all the high utility sequential patterns. To the best of our knowledge, HUSP-NIV is the first method to mine HUSP with NIV, which is shown efficient on both synthetic and real datasets.


Author(s):  
Tiantian Xu ◽  
Jianliang Xu ◽  
Xiangjun Dong

High utility sequential patterns (HUSP) mining has recently received a lot of attention from researchers. Many algorithms have been proposed to mine HUSP and most of them only use a single minimum utility, which implicitly assumes that all items in the database are of the same importance (such as profit), or other information based on users’ concern in the database. This is often not the case in real-life applications. Although a few methods have been proposed to mine high utility itemsets (HUI) with multiple minimum utility (MMU), they are not suitable for mining HUSP with MMU because an item may occur more than one time in a sequence and may have multiple utility values. In this paper, we propose a novel method, called HUSpan-MMU, to efficiently mine HUSP with MMU from sequential utility-based databases. A lexicographic quantitative sequence tree (LQS-tree) is used to extract the complete set of HUSP. Meanwhile, two pruning methods are used to reduce the search space in the LQS-tree. Experimental results on both synthetic and real datasets show that HUSpan-MMU can efficiently mine HUSP with MMU from utility-based databases.


2016 ◽  
Vol 3 (2) ◽  
pp. 82-89
Author(s):  
Humberto López Castillo ◽  
Elizabeth A. Lockhart ◽  
Alison B. Oberne ◽  
Ellen M. Daley

Teaching sexual and reproductive health in general, and contraceptives in particular, presents instructors with diverse challenges. While instructors need to cover textbook concepts, the classroom setting does not offer much context for a significant, experiential learning opportunity. We have developed and implemented a Contraceptives Scavenger Hunt assignment, designed to facilitate experiential learning and put class concepts into real-life context. Students were provided with three groups of sexual and reproductive health items that were discussed in class (contraceptives for males, contraceptives for females, and other interesting items). They had to choose one item from each group and hunt for it in local stores, pharmacies, sex shops, and so on. They reported on their overall experience and identified barriers they would not have thought of in class (e.g., transportation, cost, ease of access). Variations to this activity, its implications for experiential learning, challenges to its implementation, and its impact on student learning outcomes were discussed.


2018 ◽  
Vol 32 (19) ◽  
pp. 1850209 ◽  
Author(s):  
Yu-Qing Wang ◽  
Chao-Fan Zhou ◽  
Zi-Ang Zhu ◽  
Jia-Wei Wang ◽  
Zi-Meng Wang ◽  
...  

Based on tremendous real data, a macroscopic model is established, which can depict the process of volatile organic compound (namely, VOC) emissions. Different from previous work, a complete set of sources is taken into account rather than only an isolated source. These data have been processed to support the sample set in order to prove the validity of the theoretical analyses. Besides, the relationship between the industrial production and VOC emissions of industrial source is discussed and depicted. Furthermore, the relationship between the electronic industrial production and VOC emissions is emphasized and calculated. VOC emissions per unit production is investigated. Additionally, the relationship between the number of sample points in the sample set and VOC emissions is illustrated. Then, the control strategy of VOC emissions is proposed by calculating the optimal solutions of each sample set. It is found that the lower the slope of optimal solutions, the lower the average VOC emissions, the better the VOC emissions control effect.


Sequential pattern mining is one of the important functionalities of data mining. It is used for analyzing sequential database and discovers sequential patterns. It is focused for extracting interesting subsequences from a set of sequences. Various factors such as rate of occurrence, length, and profit are used to define the interestingness of subsequence derived from the sequence database. Sequential pattern mining has abundant real-life applications since sequential data is logically programmed as sequences of cipher in many fields such as bioinformatics, e-learning, market basket analysis, texts, and webpage click-stream analysis. A large diversity of competent algorithms such as Prefixspan, GSP and Freespan have been proposed during the past few years. In this paper we propose a data model for organizing the sequential database, which consists of a directed graph DGS (cycles and several edges are allowed) and an organization of directed paths in DGS to represent a sequential data for discovering sequential pattern3 from a sequence database. Competent algorithms for constructing the digraph model (DGS) for extracting all sequential patterns and mining association rules are proposed. A number of theoretical parameters of digraph model are also introduced, which lead to more understanding of the problem.


The problem of mining sequential patterns from medical data has received a lot of attention as it aims to discover the causal relationship between different diseases or symptoms that are present in the patient’s body. Medical data contains the records pertaining to the information of the diseases or the symptoms of the patients besides the patients’ personal information. The records are ordered in accordance with the time and date of the patients’ visit in the hospital. Such data may offer precious information related to the cause and effect of a disease on the human body. Although, the date and time gives us chronological ordering of the occurrence of the diseases in the human body, it does not provide the information about the time intervals within which the successive diseases may occur. If the time gap of cause and effect is found to be too large, the concerned sequential pattern would be un-realistic. Considering, the time attributes of medical data, we try to address the above-mentioned problem on the sequential patterns. In this paper, we propose a method of extracting sequential patterns from medical dataset, with time-restrictions. The method extracts all sequences of diseases which occur within user-specified time intervals. The efficacy of our method is established with an experiment conducted on real life medical datasets


Author(s):  
Sherlyane Hendri ◽  
Ary Kiswanto Kenedi

Students’ problem solving ability at any school is still in low category. That is due to student weakness to understand any mathematical problems that linked to the real life around them. One of any cause of this problem is  mathematics lesson equipments development and its utilization by teacher are not optimal yet. To solve this problem, mathematics teachers should develop mathematics lesson equipments based on discovery learning for class V student at Elementary school that sutisfied validity, practicality, and efectiveness criteria. This is a development research with 4D models consisting of Define, Design, Develop, and Disseminate. In preliminary research for Define stage, researcher takes analysis of needs, analysis of curiculum, and analysis of mathematic concepts. Techniques of data collection through interview with teacher mathematics, questionnaire learners, curriculum analysis. Based on the analysis found that the needs of learners in the form of new learning model in addition to conventional learning are better able to increase problem solving ability of able to improve students' problem solving abilities to achieve desired goals and then designed a mathematics teaching material based on discovery learning.Keywords : Problem solving ability, Discovery Learning, 4-D model


Author(s):  
Laor Boongasame ◽  
Dickson K. W. Chiu

Coalition stability is a major requirement in coalition formation. One important problem to achieve stability in n-person game theories is the assumption that the preference of each buyer is publicly known. The coalition is said to be stable if there are no objection by any subset of buyers according to their publicly known preferences. However, such assumption is often unrealistic in typical real-life situations. Individual buyers often have private preferences and make their decisions according to their own preferences instead. This study proposes a novel preference coalition formation scheme for buyer coalition services that attempts to consider private preference of individual buyers within the buyer coalition process. The theoretical foundations of the study are rooted in the fields of multi-criteria decision making, human practical reasoning, and n-person game theories, from which we design an appropriate scheme for our proposed buyer coalition framework with emphasis on private preferences of individual buyers. The authors validate their proposed scheme with simulation software developed to demonstrate results of a variety of practical situations.


Author(s):  
Kang-Yu He ◽  
Xiao-Xu Yuan

Abstract In this paper, we presented a mew mechanical model of an impact damper based on the theoretical analyses and experiments. We also presented a method of identifying the parameters of the non∼linear method of an impact system by means of microcomputer calculations. The method of identification presented in this paper has been proved byour experiments. These works supply the theoretical foundations of designing and calculating for impact damper.


Author(s):  
Tiantian Xu ◽  
Xiangjun Dong ◽  
Jianliang Xu ◽  
Yongshun Gong

Mining negative sequential patterns (NSP) has been an important research area in data mining and knowledge discovery and it is much more challenging than mining positive sequential patterns (PSP) due to the computational complexity and search space. Only a few methods have been proposed to mine NSP and most of them only use single minimum support, which implicitly assumes that all items in the database are of the same nature or of similar frequencies in the database. This is often not the case in real-life applications. There are several methods to mine sequential patterns with multiple minimum supports (MMS), but these methods only consider PSP and do not handle NSP. So in this paper, we propose a new method, called e-msNSP, to mine NSP with multiple minimum supports. We also solve the problem of how to set up the minimum support to a sequence with negative item(s). E-msNSP consists of three major steps: (i) using the improved MS-GSP method to mine PSP with multiple minimum supports and storing all positive sequential candidates’ (PSC) related information simultaneously; (ii) using the same method in e-NSP to generate negative sequential candidates (NSC) based on above mined PSP; (iii) calculating the support of these NSC based only on the corresponding PSP and then getting NSP. To the best of our knowledge, e-msNSP is the first method to mine NSP with MMS and does not impose strict constraints. Experimental results show that the e-msNSP is highly effective and efficient.


Sign in / Sign up

Export Citation Format

Share Document