A Recurrent Model for Collective Entity Linking with Adaptive Features

The vast amount of web data enables us to build knowledge bases with unprecedented quality and coverage. Named Entity Disambiguation (NED) is an important task that automatically resolves ambiguous mentions in free text to correct target entries in the knowledge base. Traditional machine learning based methods for NED were outperformed and made obsolete by the state-of-the-art deep learning based models. However, deep learning models are more complex, requiring large amount of training data and lengthy training and parameter tuning time. In this paper, we revisit traditional machine learning techniques and propose a light-weight, tuneable and time-efficient method without using deep learning or deep learning generated features. We propose novel adaptive features that focus on extracting discriminative features to better model similarities between candidate entities and the mention's context. We learn a local ranking model based on traditional and the new adaptive features based on the learning-to-rank framework. While arriving at linking decisions individually via the local model, our method also takes into consideration the correlation between decisions by running multiple recurrent global models, which can be deemed as a learned local search method. Our method attains performances comparable to the state-of-the-art deep learning-based methods on NED benchmark datasets while being significantly faster to train.

Download Full-text

Damage Detection Method for Buildings with Machine-Learning Techniques Utilizing Images of Automobile Running Surveys Aftermath of the 2016 Kumamoto Earthquake

Journal of Disaster Research ◽

10.20965/jdr.2018.p0928 ◽

2018 ◽

Vol 13 (5) ◽

pp. 928-942

Author(s):

Shohei Naito ◽

Hiromitsu Tomozawa ◽

Yuji Mori ◽

Hiromitsu Nakamura ◽

Hiroyuki Fujiwara ◽

...

Keyword(s):

Machine Learning ◽

Parameter Tuning ◽

Learning Phase ◽

Training Data ◽

Machine Learning Techniques ◽

Support Vector ◽

Kumamoto Earthquake ◽

Class Separation ◽

Discrimination Phase ◽

Patch Image

In order to understand the damage situation immediately after the occurrence of a disaster and to support disaster response, we developed a method to classify the degree of building damage in three stages with machine-learning using road-running survey images obtained immediately after the Kumamoto earthquake. Machine-learning involves a learning phase and a discrimination phase. As training data, we used images from a camera installed in the travel direction of an automobile, in which the degree of damage was visually categorized. In the learning phase, class separation is carried out by support vector machine (SVM) on a basis of a feature calculated from training patch images for each extracted damage category. In the discrimination phase, input images are provided with raster scan so that the class separation is carried out in units of the patch image. In this manner, learning, discrimination, and parameter tuning are repeated. By doing so, we developed a damage-discrimination model for each patch image and validated the discrimination accuracy using a cross-validation method. Furthermore, we developed a method using an optical flow for preventing double counting of damaged areas in cases where an identical building is captured in multiple photos.

Download Full-text

Automatic microseismic event picking via unsupervised machine learning

Geophysical Journal International ◽

10.1093/gji/ggaa186 ◽

2020 ◽

Vol 222 (3) ◽

pp. 1750-1764 ◽

Cited By ~ 1

Author(s):

Yangkang Chen

Keyword(s):

Machine Learning ◽

Clustering Algorithm ◽

Learning Algorithm ◽

State Of The Art ◽

The State ◽

Training Data ◽

Supervised Machine Learning ◽

Machine Learning Algorithm ◽

Unsupervised Machine Learning ◽

Earthquake Data

SUMMARY Effective and efficient arrival picking plays an important role in microseismic and earthquake data processing and imaging. Widely used short-term-average long-term-average ratio (STA/LTA) based arrival picking algorithms suffer from the sensitivity to moderate-to-strong random ambient noise. To make the state-of-the-art arrival picking approaches effective, microseismic data need to be first pre-processed, for example, removing sufficient amount of noise, and second analysed by arrival pickers. To conquer the noise issue in arrival picking for weak microseismic or earthquake event, I leverage the machine learning techniques to help recognizing seismic waveforms in microseismic or earthquake data. Because of the dependency of supervised machine learning algorithm on large volume of well-designed training data, I utilize an unsupervised machine learning algorithm to help cluster the time samples into two groups, that is, waveform points and non-waveform points. The fuzzy clustering algorithm has been demonstrated to be effective for such purpose. A group of synthetic, real microseismic and earthquake data sets with different levels of complexity show that the proposed method is much more robust than the state-of-the-art STA/LTA method in picking microseismic events, even in the case of moderately strong background noise.

Download Full-text

Application of Machine Learning in Rhinology: A State of the Art Review

Korean Journal of Otorhinolaryngology - Head and Neck Surgery ◽

10.3342/kjorl-hns.2020.00633 ◽

2020 ◽

Vol 63 (8) ◽

pp. 341-349

Author(s):

Myeong Sang Yu

Keyword(s):

Machine Learning ◽

Big Data ◽

Deep Learning ◽

Medical Records ◽

State Of The Art ◽

Machine Learning Techniques ◽

Machine Learning Technique ◽

Learning Techniques ◽

Learning Technique ◽

Machine Learning Applications

The revolutionary development of artificial intelligence (AI) such as machine learning and deep learning have been one of the most important technology in many parts of industry, and also enhance huge changes in health care. The big data obtained from electrical medical records and digitalized images accelerated the application of AI technologies in medical fields. Machine learning techniques can deal with the complexity of big data which is difficult to apply traditional statistics. Recently, the deep learning techniques including convolutional neural network have been considered as a promising machine learning technique in medical imaging applications. In the era of precision medicine, otolaryngologists need to understand the potentialities, pitfalls and limitations of AI technology, and try to find opportunities to collaborate with data scientists. This article briefly introduce the basic concepts of machine learning and its techniques, and reviewed the current works on machine learning applications in the field of otolaryngology and rhinology.

Download Full-text

Protein sequence design with a learned potential

10.1101/2020.01.06.895466 ◽

2020 ◽

Cited By ~ 5

Author(s):

Namrata Anand-Achim ◽

Raphael R. Eguchi ◽

Alexander Derry ◽

Russ B. Altman ◽

Po-Ssu Huang

Keyword(s):

Neural Network ◽

Machine Learning ◽

Deep Learning ◽

Amino Acid ◽

Protein Design ◽

Protein Sequence ◽

State Of The Art ◽

The State ◽

Energy Functions ◽

Sequence Design

AbstractThe primary challenge of fixed-backbone protein design is to find a distribution of sequences that fold to the backbone of interest. This task is central to nearly all protein engineering problems, as achieving a particular backbone conformation is often a prerequisite for hosting specific functions. In this study, we investigate the capability of a deep neural network to learn the requisite patterns needed to design sequences. The trained model serves as a potential function defined over the space of amino acid identities and rotamer states, conditioned on the local chemical environment at each residue. While most deep learning based methods for sequence design only produce amino acid sequences, our method generates full-atom structural models, which can be evaluated using established sequence quality metrics. Under these metrics we are able to produce realistic and variable designs with quality comparable to the state-of-the-art. Additionally, we experimentally test designs for a de novo TIM-barrel structure and find designs that fold, demonstrating the algorithm’s generalizability to novel structures. Overall, our results demonstrate that a deep learning model can match state-of-the-art energy functions for guiding protein design.SignificanceProtein design tasks typically depend on carefully modeled and parameterized heuristic energy functions. In this study, we propose a novel machine learning method for fixed-backbone protein sequence design, using a learned neural network potential to not only design the sequence of amino acids but also select their side-chain configurations, or rotamers. Factoring through a structural representation of the protein, the network generates designs on par with the state-of-the-art, despite having been entirely learned from data. These results indicate an exciting future for protein design driven by machine learning.

Download Full-text

Review on Various Machine Learning and Deep Learning Techniques for Prediction and Classification of Quotidian Datasets

Recent Advances in 3D Imaging, Modeling, and Reconstruction - Advances in Multimedia and Interactive Technologies ◽

10.4018/978-1-5225-5294-9.ch014 ◽

2020 ◽

pp. 296-323

Author(s):

Anisha M. Lal ◽

B. Koushik Reddy ◽

Aju D.

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Learning Algorithm ◽

Machine Learning Algorithms ◽

Training Data ◽

Supervised Machine Learning ◽

Machine Learning Techniques ◽

Learning Techniques ◽

Supervised Methods ◽

Regression Techniques

Machine learning can be defined as the ability of a computer to learn and solve a problem without being explicitly coded. The efficiency of the program increases with experience through the task specified. In traditional programming, the program and the input are specified to get the output, but in the case of machine learning, the targets and predictors are provided to the algorithm make the process trained. This chapter focuses on various machine learning techniques and their performance with commonly used datasets. A supervised learning algorithm consists of a target variable that is to be predicted from a given set of predictors. Using these established targets is a function that plots targets to a given set of predictors. The training process allows the system to train the unknown data and continues until the model achieves a desired level of accuracy on the training data. The supervised methods can be usually categorized as classification and regression. This chapter discourses some of the popular supervised machine learning algorithms and their performances using quotidian datasets. This chapter also discusses some of the non-linear regression techniques and some insights on deep learning with respect to object recognition.

Download Full-text

The state-of-the-art on Intellectual Property Analytics (IPA): A literature review on artificial intelligence, machine learning and deep learning methods for analysing intellectual property (IP) data

World Patent Information ◽

10.1016/j.wpi.2018.07.002 ◽

2018 ◽

Vol 55 ◽

pp. 37-51 ◽

Cited By ~ 22

Author(s):

Leonidas Aristodemou ◽

Frank Tietze

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Deep Learning ◽

Intellectual Property ◽

Literature Review ◽

State Of The Art ◽

The State ◽

Learning Methods

Download Full-text

Bias Modeling for Distantly Supervised Relation Extraction

Mathematical Problems in Engineering ◽

10.1155/2015/969053 ◽

2015 ◽

Vol 2015 ◽

pp. 1-10 ◽

Cited By ~ 2

Author(s):

Yang Xiang ◽

Yaoyun Zhang ◽

Xiaolong Wang ◽

Yang Qin ◽

Wenying Han

Keyword(s):

Language Processing ◽

Learning Algorithm ◽

State Of The Art ◽

Relation Extraction ◽

Knowledge Bases ◽

Training Data ◽

Free Text ◽

Distant Supervision ◽

Annotation Process ◽

Noise Tolerant

Distant supervision (DS) automatically annotates free text with relation mentions from existing knowledge bases (KBs), providing a way to alleviate the problem of insufficient training data for relation extraction in natural language processing (NLP). However, the heuristic annotation process does not guarantee the correctness of the generated labels, promoting a hot research issue on how to efficiently make use of the noisy training data. In this paper, we model two types of biases to reduce noise: (1)bias-distto model the relative distance between points (instances) and classes (relation centers); (2)bias-rewardto model the possibility of each heuristically generated label being incorrect. Based on the biases, we propose three noise tolerant models:MIML-dist,MIML-dist-classify, andMIML-reward, building on top of a state-of-the-art distantly supervised learning algorithm. Experimental evaluations compared with three landmark methods on the KBP dataset validate the effectiveness of the proposed methods.

Download Full-text

Deep Learning Based-Phishing Attack Detection

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.c6527.098319 ◽

2019 ◽

Vol 8 (3) ◽

pp. 8428-8432

Keyword(s):

Neural Network ◽

Machine Learning ◽

Deep Learning ◽

Human Life ◽

Rapid Development ◽

Attack Detection ◽

Cyber Attacks ◽

Training Data ◽

Machine Learning Techniques ◽

Support Vector

Due to the rapid development of the communication technologies and global networking, lots of daily human life activities such as electronic banking, social networks, ecommerce, etc are transferred to the cyberspace. The anonymous, open and uncontrolled infrastructure of the internet enables an excellent platform for cyber attacks. Phishing is one of the cyber attacks in which attackers open some fraudulent websites similar to the popular and legal websites to steal the user’s sensitive information. Machine learning techniques such as J48, Support Vector Machine (SVM), Logistic Regression (LR), Naive Bayes (NB) and Artificial Neural Network (ANN) were widely to detect the phishing attacks. But, getting goodquality training data is one of the biggest problems in machine learning. So, a deep learning method called Deep Neural Network (DNN) is introduced to detect the phishing Uniform Resource Locators (URLs). Initially, a feature extractor is used to construct a 30-dimension feature vector based on URL-based features, HTML-based features and domain-based features. These features are given as input to the DNN classifier for phishing attack detection. It consists of one input layer, multiple hidden layers and one output layer. The multiple hidden layers in DNN try to learn high-level features in an incremental manner. Finally, the DNN returns a probability value which represent the phishing URLs and legitimate URLs. By using DNN the accuracy, precision and recall of phishing attack detection is improved.

Download Full-text

p1-FP: Extraction, Classification, and Prediction of Website Fingerprints with Deep Learning

Proceedings on Privacy Enhancing Technologies ◽

10.2478/popets-2019-0043 ◽

2019 ◽

Vol 2019 (3) ◽

pp. 191-209 ◽

Cited By ~ 1

Author(s):

Se Eun Oh ◽

Saikrishna Sunkam ◽

Nicholas Hopper

Keyword(s):

Neural Network ◽

Machine Learning ◽

Deep Learning ◽

Deep Neural Network ◽

State Of The Art ◽

Machine Learning Techniques ◽

Feature Engineering ◽

Engineering Process ◽

Learning Techniques ◽

Wide Range

Abstract Recent advances in Deep Neural Network (DNN) architectures have received a great deal of attention due to their ability to outperform state-of-the-art machine learning techniques across a wide range of application, as well as automating the feature engineering process. In this paper, we broadly study the applicability of deep learning to website fingerprinting. First, we show that unsupervised DNNs can generate lowdimensional informative features that improve the performance of state-of-the-art website fingerprinting attacks. Second, when used as classifiers, we show that they can exceed performance of existing attacks across a range of application scenarios, including fingerprinting Tor website traces, fingerprinting search engine queries over Tor, defeating fingerprinting defenses, and fingerprinting TLS-encrypted websites. Finally, we investigate which site-level features of a website influence its fingerprintability by DNNs.

Download Full-text

Emotional quantification of soundscapes by learning between samples

Multimedia Tools and Applications ◽

10.1007/s11042-020-09430-3 ◽

2020 ◽

Vol 79 (41-42) ◽

pp. 30387-30395

Author(s):

Stavros Ntalampiras

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Deep Learning ◽

State Of The Art ◽

Emotional Responses ◽

Machine Learning Techniques ◽

Specific Domain ◽

Learning Framework ◽

Learning Techniques ◽

Wide Range

Abstract Predicting the emotional responses of humans to soundscapes is a relatively recent field of research coming with a wide range of promising applications. This work presents the design of two convolutional neural networks, namely ArNet and ValNet, each one responsible for quantifying arousal and valence evoked by soundscapes. We build on the knowledge acquired from the application of traditional machine learning techniques on the specific domain, and design a suitable deep learning framework. Moreover, we propose the usage of artificially created mixed soundscapes, the distributions of which are located between the ones of the available samples, a process that increases the variance of the dataset leading to significantly better performance. The reported results outperform the state of the art on a soundscape dataset following Schafer’s standardized categorization considering both sound’s identity and the respective listening context.

Download Full-text