sequential semantics Latest Research Papers

To accelerate software development, developers frequently search and reuse existing code snippets from a large-scale codebase, e.g., GitHub. Over the years, researchers proposed many information retrieval (IR)-based models for code search, but they fail to connect the semantic gap between query and code. An early successful deep learning (DL)-based model DeepCS solved this issue by learning the relationship between pairs of code methods and corresponding natural language descriptions. Two major advantages of DeepCS are the capability of understanding irrelevant/noisy keywords and capturing sequential relationships between words in query and code. In this article, we proposed an IR-based model CodeMatcher that inherits the advantages of DeepCS (i.e., the capability of understanding the sequential semantics in important query words), while it can leverage the indexing technique in the IR-based model to accelerate the search response time substantially. CodeMatcher first collects metadata for query words to identify irrelevant/noisy ones, then iteratively performs fuzzy search with important query words on the codebase that is indexed by the Elasticsearch tool and finally reranks a set of returned candidate code according to how the tokens in the candidate code snippet sequentially matched the important words in a query. We verified its effectiveness on a large-scale codebase with ~41K repositories. Experimental results showed that CodeMatcher achieves an MRR (a widely used accuracy measure for code search) of 0.60, outperforming DeepCS, CodeHow, and UNIF by 82%, 62%, and 46%, respectively. Our proposed model is over 1.2K times faster than DeepCS. Moreover, CodeMatcher outperforms two existing online search engines (GitHub and Google search) by 46% and 33%, respectively, in terms of MRR. We also observed that: fusing the advantages of IR-based and DL-based models is promising; improving the quality of method naming helps code search, since method name plays an important role in connecting query and code.

Download Full-text

End-to-End Transition-Based Online Dialogue Disentanglement

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2020/535 ◽

2020 ◽

Author(s):

Hui Liu ◽

Zhan Shi ◽

Jia-Chen Gu ◽

Quan Liu ◽

Si Wei ◽

...

Keyword(s):

Large Scale ◽

Online Algorithm ◽

Research Field ◽

Semantic Coherence ◽

Large Scale Dataset ◽

End To End ◽

Sequential Information ◽

Almost All ◽

Classification And Clustering ◽

Sequential Semantics

Dialogue disentanglement aims to separate intermingled messages into detached sessions. The existing research focuses on two-step architectures, in which a model first retrieves the relationships between two messages and then divides the message stream into separate clusters. Almost all existing work puts significant efforts on selecting features for message-pair classification and clustering, while ignoring the semantic coherence within each session. In this paper, we introduce the first end-to- end transition-based model for online dialogue disentanglement. Our model captures the sequential information of each session as the online algorithm proceeds on processing a dialogue. The coherence in a session is hence modeled when messages are sequentially added into their best-matching sessions. Meanwhile, the research field still lacks data for studying end-to-end dialogue disentanglement, so we construct a large-scale dataset by extracting coherent dialogues from online movie scripts. We evaluate our model on both the dataset we developed and the publicly available Ubuntu IRC dataset [Kummerfeld et al., 2019]. The results show that our model significantly outperforms the existing algorithms. Further experiments demonstrate that our model better captures the sequential semantics and obtains more coherent disentangled sessions.

Download Full-text

Enhancing Unsupervised Requirements Traceability with Sequential Semantics

2019 26th Asia-Pacific Software Engineering Conference (APSEC) ◽

10.1109/apsec48747.2019.00013 ◽

2019 ◽

Author(s):

Lei Chen ◽

Dandan Wang ◽

Junjie Wang ◽

Qing Wang

Keyword(s):

Requirements Traceability ◽

Sequential Semantics

Download Full-text

The sequential semantics of producer effect systems

ACM SIGPLAN Notices ◽

10.1145/2480359.2429074 ◽

2013 ◽

Vol 48 (1) ◽

pp. 15-26 ◽

Cited By ~ 1

Author(s):

Ross Tate

Keyword(s):

Sequential Semantics

Download Full-text

The sequential semantics of producer effect systems

Proceedings of the 40th annual ACM SIGPLAN-SIGACT symposium on Principles of programming languages - POPL '13 ◽

10.1145/2429069.2429074 ◽

2013 ◽

Cited By ~ 16

Author(s):

Ross Tate

Keyword(s):

Sequential Semantics

Download Full-text

A Trellis Notion for Distributed System Diagnosis with Sequential Semantics

2006 8th International Workshop on Discrete Event Systems ◽

10.1109/wodes.2006.1678445 ◽

2006 ◽

Cited By ~ 2

Author(s):

E. Fabre

Keyword(s):

Distributed System ◽

System Diagnosis ◽

Sequential Semantics

Download Full-text

FUNCTIONAL PEARL Functional satisfaction

Journal of Functional Programming ◽

10.1017/s0956796804005155 ◽

2004 ◽

Vol 14 (6) ◽

pp. 647-656

Author(s):

LUC MARANGET

Keyword(s):

Programming Language ◽

Propositional Calculus ◽

Decision Procedures ◽

Predicate Calculus ◽

Machine Time ◽

Sequential Semantics

This work presents simple decision procedures for the propositional calculus and for a simple predicate calculus. These decision procedures are based upon enumeration of the possible values of the variables in an expression. Yet, by taking advantage of the sequential semantics of boolean connectors, not all values are enumerated. In some cases, dramatic savings of machine time can be achieved. In particular, an equivalence checker for a small programming language appears to be usable in practice.

Download Full-text