Enhancing Object Distinction Utilizing Probabilistic Topic Model

Website fingerprinting (WFP) attack enables identifying the websites a user is browsing even under the protection of privacy-enhancing technologies (PETs). Previous studies demonstrate that most machine-learning attacks need multiple types of features as input, thus inducing tremendous feature engineering work. However, we show the other alternative. That is, we present Probabilistic Fingerprinting (PF), a new website fingerprinting attack that merely leverages one type of features. They are produced by using a mathematical model PWFP that combines a probabilistic topic model with WFP for the first time, due to a finding that a plain text and the sequence file generated from a traffic instance are essentially the same. Experimental results show that the proposed new features are more distinguishing than the existing features. In a closed-world setting, PF attains a better accuracy performance (99.79% at most) than prior attacks on various datasets gathered in the scenarios of Shadowsocks, SSH, and TLS, respectively. Besides, even when the number of training instances drops to as few as 4, PF still reaches an accuracy of above 90%. In the more realistic open-world setting, PF attains a high true positive rate (TPR) and Bayes detection rate (BDR), and a low false positive rate (FPR) in all evaluations, which outperforms the other attacks. These results highlight that it is meaningful and possible to explore new features to improve the accuracy of WFP attacks.

Download Full-text

A probabilistic topic model using deep visual word representation for simultaneous image classification and annotation

Journal of Visual Communication and Image Representation ◽

10.1016/j.jvcir.2019.01.009 ◽

2019 ◽

Vol 59 ◽

pp. 195-203 ◽

Cited By ~ 3

Author(s):

Seyed Navid Mohammadi Foumani ◽

Ahmad Nickabadi

Keyword(s):

Image Classification ◽

Topic Model ◽

Visual Word ◽

Probabilistic Topic Model ◽

Word Representation

Download Full-text

A user-oriented semi-supervised probabilistic topic model

2016 2nd IEEE International Conference on Computer and Communications (ICCC) ◽

10.1109/compcomm.2016.7924706 ◽

2016 ◽

Author(s):

Jing Li ◽

Yongbin Qin ◽

Ruizhang Huang

Keyword(s):

Topic Model ◽

Probabilistic Topic Model

Download Full-text

A probabilistic topic model for clinical risk stratification from electronic health records

Journal of Biomedical Informatics ◽

10.1016/j.jbi.2015.09.005 ◽

2015 ◽

Vol 58 ◽

pp. 28-36 ◽

Cited By ~ 27

Author(s):

Zhengxing Huang ◽

Wei Dong ◽

Huilong Duan

Keyword(s):

Electronic Health Records ◽

Risk Stratification ◽

Topic Model ◽

Health Records ◽

Clinical Risk ◽

Probabilistic Topic Model ◽

Electronic Health

Download Full-text

A scalable automatic service discovery approach based on probabilistic topic model

International Journal of Web and Grid Services ◽

10.1504/ijwgs.2016.10001002 ◽

2016 ◽

Vol 12 (4) ◽

pp. 349

Author(s):

Yuan Yuan ◽

Xiuguo Zhang ◽

Weishi Zhang

Keyword(s):

Service Discovery ◽

Topic Model ◽

Probabilistic Topic Model

Download Full-text

Inferring functional miRNA–mRNA regulatory modules in epithelial–mesenchymal transition with a probabilistic topic model

Computers in Biology and Medicine ◽

10.1016/j.compbiomed.2011.12.011 ◽

2012 ◽

Vol 42 (4) ◽

pp. 428-437 ◽

Cited By ~ 5

Author(s):

Junpeng Zhang ◽

Bing Liu ◽

Jianfeng He ◽

Lei Ma ◽

Jiuyong Li

Keyword(s):

Topic Model ◽

Epithelial Mesenchymal Transition ◽

Mesenchymal Transition ◽

Regulatory Modules ◽

Probabilistic Topic Model

Download Full-text

Real-time traffic incident detection using a probabilistic topic model

Information Systems ◽

10.1016/j.is.2015.07.002 ◽

2015 ◽

Vol 54 ◽

pp. 169-188 ◽

Cited By ~ 23

Author(s):

Akira Kinoshita ◽

Atsuhiro Takasu ◽

Jun Adachi

Keyword(s):

Real Time ◽

Topic Model ◽

Incident Detection ◽

Traffic Incident ◽

Real Time Traffic ◽

Probabilistic Topic Model

Download Full-text

SCENE CLASSFICATION BASED ON THE SEMANTIC-FEATURE FUSION FULLY SPARSE TOPIC MODEL FOR HIGH SPATIAL RESOLUTION REMOTE SENSING IMAGERY

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xli-b7-451-2016 ◽

2016 ◽

Vol XLI-B7 ◽

pp. 451-457

Author(s):

Qiqi Zhu ◽

Yanfei Zhong ◽

Liangpei Zhang

Keyword(s):

Spatial Resolution ◽

Spatial Information ◽

Topic Model ◽

High Spatial Resolution ◽

Feature Fusion ◽

Spectral Feature ◽

Semantic Feature ◽

Scene Classification ◽

Probabilistic Topic Model ◽

Sparse Topic Model

Topic modeling has been an increasingly mature method to bridge the semantic gap between the low-level features and high-level semantic information. However, with more and more high spatial resolution (HSR) images to deal with, conventional probabilistic topic model (PTM) usually presents the images with a dense semantic representation. This consumes more time and requires more storage space. In addition, due to the complex spectral and spatial information, a combination of multiple complementary features is proved to be an effective strategy to improve the performance for HSR image scene classification. But it should be noticed that how the distinct features are fused to fully describe the challenging HSR images, which is a critical factor for scene classification. In this paper, a semantic-feature fusion fully sparse topic model (SFF-FSTM) is proposed for HSR imagery scene classification. In SFF-FSTM, three heterogeneous features – the mean and standard deviation based spectral feature, wavelet based texture feature, and dense scale-invariant feature transform (SIFT) based structural feature are effectively fused at the latent semantic level. The combination of multiple semantic-feature fusion strategy and sparse based FSTM is able to provide adequate feature representations, and can achieve comparable performance with limited training samples. Experimental results on the UC Merced dataset and Google dataset of SIRI-WHU demonstrate that the proposed method can improve the performance of scene classification compared with other scene classification methods for HSR imagery.

Download Full-text

A Probabilistic Topic Model based on Local Word Relationships in Overlapped Windows

Signal and Data Processing ◽

10.29252/jsdp.15.4.57 ◽

2019 ◽

Vol 15 (4) ◽

pp. 57-70

Author(s):

Marziea Rahimi ◽

Morteza Zahedi ◽

Hoda Mashayekhi ◽

◽

...

Keyword(s):

Topic Model ◽

Model Based ◽

Probabilistic Topic Model

Download Full-text

A ML and NLP based Framework for Sentiment Analysis on Bigdata

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijitee.d9062.029420 ◽

2020 ◽

Vol 8 (5) ◽

pp. 189-200

Keyword(s):

Social Networks ◽

Sentiment Analysis ◽

Language Processing ◽

Online Social Networks ◽

Topic Model ◽

Social Feedback ◽

Feedback Systems ◽

Multiple Sources ◽

Probabilistic Topic Model ◽

Novel Approach

Big data as multiple sources and social media is one of them. Such data is rich in opinion of people and needs automated approach with Natural Language Processing (NLP) and Machine Learning (ML) to obtain and summarize social feedback. With ML as an integral part of Artificial Intelligence (AI), machines can demonstrate intelligence exhibited by humans. ML is widely used in different domains. With proliferation of Online Social Networks (OSNs), people of all walks of life exchange their views instantly. Thus they became platforms where opinions or people are available. In other words, social feedback on products and services are available. For instance, Twitter produces large volumes of such data which is of much use to enterprises to garner Business Intelligence (BI) useful to make expert decisions. In addition to the traditional feedback systems, the feedback (opinions) over social networks provide depth in the intelligence to revise strategies and policies. Sentiment analysis is the phenomenon which is employed to analyze opinions and classify them into positive, negative and neutral. Existing studies usually treated overall sentiment analysis and aspect-based sentiment analysis in isolation, and then introduce a variety of methods to analyse either overall sentiments or aspect-level sentiments, but not both. Usage of probabilistic topic model is a novel approach in sentiment analysis. In this paper, we proposed a framework for comprehensive analysis of overall and aspect-based sentiments. The framework is realized with aspect based topic modelling for sentiment analysis and ensemble learning algorithms. It also employs many ML algorithms with supervised learning approach. Benchmark datasets used in international SemEval conferences are used for empirical study. Experimental results revealed the efficiency of the proposed framework over the state of the art.

Download Full-text