Clustering Techniques for Rule Extraction from Unstructured Text Fragments

Manufacturing companies maintain manufacturing knowledge primarily as unstructured text. To facilitate formal use of such knowledge, previous efforts have utilized natural language processing (NLP) to classify manufacturing documents or extract manufacturing concepts/relations. However, extracting more complex knowledge, such as manufacturing rules, has been evasive due to the lack of methods to resolve ambiguities. Specifically, standard NLP techniques do not address domain-specific ambiguities that are due to manufacturing-specific meanings implicit in the text. To address this important gap, we propose an ambiguity resolution method that utilizes domain ontology as the mechanism to incorporate the domain context. We demonstrate its feasibility by extending our previously implemented manufacturing rule extraction framework. The effectiveness of the method is demonstrated by resolving all the domain-specific ambiguities in the dataset and an improvement in correct detection of rules to 70% (increased by about 13%). We expect that this work will contribute to the adoption of semantics-based technology in manufacturing field, by enabling the extraction of precise formal knowledge from text.

Download Full-text

Clustering techniques for thyroid nodules malignancy inference in the era of personalized medicine

Endocrine Abstracts ◽

10.1530/endoabs.70.ep445 ◽

2020 ◽

Author(s):

Andrea Giani ◽

de Souza Patricia Borges ◽

Stefania Bartoletti ◽

Flavio Morselli ◽

Andrea Conti ◽

...

Keyword(s):

Personalized Medicine ◽

Thyroid Nodules ◽

Clustering Techniques

Download Full-text

A Survival Study on Data Structure Based Clustering Techniques for Multidimensional Data Stream Analysis

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v5i12.101108 ◽

2017 ◽

Vol 5 (12) ◽

pp. 101-108

Author(s):

K. Chitra ◽

◽

D. Maheswari

Keyword(s):

Data Structure ◽

Data Stream ◽

Multidimensional Data ◽

Clustering Techniques ◽

Survival Study ◽

Data Stream Analysis

Download Full-text

A State of Art Approaches on Energy Efficient Clustering Techniques in WSN

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v7i3.5054 ◽

2019 ◽

Vol 7 (3) ◽

pp. 50-54

Author(s):

N. Thilagavathi ◽

Christy Wood ◽

V. Hemalakshumi ◽

V. Mathumiithaa

Keyword(s):

Energy Efficient ◽

Clustering Techniques ◽

Energy Efficient Clustering ◽

State Of Art

Download Full-text

Examination of Clustering Techniques using Genetic Algorithm

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v6i4.374378 ◽

2018 ◽

Vol 6 (4) ◽

pp. 374-378

Author(s):

S. Ramya ◽

◽

N. Subha

Keyword(s):

Genetic Algorithm ◽

Clustering Techniques

Download Full-text

Systematic Defect Identification through Layout Snippet Clustering

ISTFA 2010: Conference Proceedings from the 36th International Symposium for Testing and Failure Analysis ◽

10.31399/asm.cp.istfa2010p0320 ◽

2010 ◽

Author(s):

Wing Chiu Tam ◽

Osei Poku ◽

R. D. (Shawn) Blanton

Keyword(s):

Design Process ◽

Integrated Circuit ◽

Yield Loss ◽

Defect Identification ◽

Clustering Techniques ◽

Dominant Component

Abstract Systematic defects due to design-process interactions are a dominant component of integrated circuit (IC) yield loss in nano-scaled technologies. Test structures do not adequately represent the product in terms of feature diversity and feature volume, and therefore are unable to identify all the systematic defects that affect the product. This paper describes a method that uses diagnosis to identify layout features that do not yield as expected. Specifically, clustering techniques are applied to layout snippets of diagnosis-implicated regions from (ideally) a statistically-significant number of IC failures for identifying feature commonalties. Experiments involving an industrial chip demonstrate the identification of possible systematic yield loss due to lithographic hotspots.

Download Full-text

Estimation and Analysis of Heart Disease using Novel Clustering Techniques

International Journal of Pharmaceutical Research ◽

10.31838/ijpr/2020.sp2.438 ◽

2020 ◽

Vol 12 (sp2) ◽

Keyword(s):

Heart Disease ◽

Clustering Techniques

Download Full-text

Analyzing the difference between deep learning and machine learning features of EEG signal using clustering techniques

2019 IEEE Region 10 Symposium (TENSYMP) ◽

10.1109/tensymp46218.2019.8971358 ◽

2019 ◽

Author(s):

Anushri Saha ◽

Sachin Singh Rathore ◽

Shivam Sharma ◽

Debasis Samanta

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Eeg Signal ◽

Clustering Techniques ◽

The Difference

Download Full-text

An algorithm to derive a numerical daily dose from unstructured text dosage instructions

Pharmacoepidemiology and Drug Safety ◽

10.1002/pds.1151 ◽

2006 ◽

Vol 15 (3) ◽

pp. 161-166 ◽

Cited By ~ 19

Author(s):

Anoop D. Shah ◽

Carlos Martinez

Keyword(s):

Unstructured Text ◽

Daily Dose

Download Full-text

Clustering Techniques for Secondary Substations Siting

Energies ◽

10.3390/en14041028 ◽

2021 ◽

Vol 14 (4) ◽

pp. 1028

Author(s):

Silvia Corigliano ◽

Federico Rosato ◽

Carla Ortiz Dominguez ◽

Marco Merlo

Keyword(s):

Rural Areas ◽

Urban Areas ◽

Universal Access ◽

Distribution Networks ◽

Industrialized Countries ◽

Agglomerative Clustering ◽

Clustering Techniques ◽

Hierarchical Agglomerative Clustering ◽

Efficient Planning ◽

Target Set

The scientific community is active in developing new models and methods to help reach the ambitious target set by UN SDGs7: universal access to electricity by 2030. Efficient planning of distribution networks is a complex and multivariate task, which is usually split into multiple subproblems to reduce the number of variables. The present work addresses the problem of optimal secondary substation siting, by means of different clustering techniques. In contrast with the majority of approaches found in the literature, which are devoted to the planning of MV grids in already electrified urban areas, this work focuses on greenfield planning in rural areas. K-means algorithm, hierarchical agglomerative clustering, and a method based on optimal weighted tree partitioning are adapted to the problem and run on two real case studies, with different population densities. The algorithms are compared in terms of different indicators useful to assess the feasibility of the solutions found. The algorithms have proven to be effective in addressing some of the crucial aspects of substations siting and to constitute relevant improvements to the classic K-means approach found in the literature. However, it is found that it is very challenging to conjugate an acceptable geographical span of the area served by a single substation with a substation power high enough to justify the installation when the load density is very low. In other words, well known standards adopted in industrialized countries do not fit with developing countries’ requirements.

Download Full-text

Clustering Techniques for Rule Extraction from Unstructured Text Fragments

Ontology-Based Ambiguity Resolution of Manufacturing Text for Formal Rule Extraction

Clustering techniques for thyroid nodules malignancy inference in the era of personalized medicine

A Survival Study on Data Structure Based Clustering Techniques for Multidimensional Data Stream Analysis

A State of Art Approaches on Energy Efficient Clustering Techniques in WSN

Examination of Clustering Techniques using Genetic Algorithm

Systematic Defect Identification through Layout Snippet Clustering

Estimation and Analysis of Heart Disease using Novel Clustering Techniques

Analyzing the difference between deep learning and machine learning features of EEG signal using clustering techniques

An algorithm to derive a numerical daily dose from unstructured text dosage instructions

Clustering Techniques for Secondary Substations Siting

Export Citation Format