Plagiarism Detection in Computer Programming Using Feature Extraction From Ultra-Fine-Grained Repositories

The existing seal ring surface defect detection methods for aerospace applications have the problems of low detection efficiency, strong specificity, large fine-grained classification errors, and unstable detection results. Considering these problems, a fine-grained seal ring surface defect detection algorithm for aerospace applications is proposed. Based on analysis of the stacking process of standard convolution, heat maps of original pixels in the receptive field participating in the convolution operation are quantified and generated. According to the generated heat map, the feature extraction optimization method of convolution combinations with different dilation rates is proposed, and an efficient convolution feature extraction network containing three kinds of dilated convolutions is designed. Combined with the O-ring surface defect features, a multiscale defect detection network is designed. Before the head of multiscale classification and position regression, feature fusion tree modules are added to ensure the reuse and compression of the responsive features of different receptive fields on the same scale feature maps. Experimental results show that on the O-rings-3000 testing dataset, the mean condition accuracy of the proposed algorithm reaches 95.10% for 5 types of surface defects of aerospace O-rings. Compared with RefineDet, the mean condition accuracy of the proposed algorithm is only reduced by 1.79%, while the parameters and FLOPs are reduced by 35.29% and 64.90%, respectively. Moreover, the proposed algorithm has good adaptability to image blur and light changes caused by the cutting of imaging hardware, thus saving the cost.

Download Full-text

PPCNN: Object Detection using Fine-grained Feature Extraction and Localization

The Journal of Korean Institute of Information Technology ◽

10.14801/jkiit.2021.19.2.29 ◽

2021 ◽

Vol 19 (2) ◽

pp. 29-37

Author(s):

Md Foysal Haque ◽

Dae-Seong Kang

Keyword(s):

Feature Extraction ◽

Object Detection ◽

Fine Grained

Download Full-text

Automated Plagiarism Detection for Computer Programming Exercises Based on Patterns of Resubmission

Proceedings of the 2018 ACM Conference on International Computing Education Research - ICER '18 ◽

10.1145/3230977.3231006 ◽

2018 ◽

Cited By ~ 5

Author(s):

Narjes Tahaei ◽

David C. Noelle

Keyword(s):

Computer Programming ◽

Plagiarism Detection

Download Full-text

Flow Chart Generation-Based Source Code Similarity Detection Using Process Mining

Scientific Programming ◽

10.1155/2020/8865413 ◽

2020 ◽

Vol 2020 ◽

pp. 1-15

Author(s):

Feng Zhang ◽

Lulu Li ◽

Cong Liu ◽

Qingtian Zeng

Keyword(s):

Computer Programming ◽

Process Mining ◽

Source Code ◽

Plagiarism Detection ◽

Dynamic Features ◽

Code Obfuscation ◽

Loop Unrolling ◽

Similarity Detection ◽

Flow Charts ◽

And Function

Source code similarity detection has extensive applications in computer programming teaching and software intellectual property protection. In the teaching of computer programming courses, students may utilize some complex source code obfuscation techniques, e.g., opaque predicates, loop unrolling, and function inlining and outlining, to reduce the similarity between code fragments and avoid the plagiarism detection. Existing source code similarity detection approaches only consider static features of source code, making it difficult to cope with more complex code obfuscation techniques. In this paper, we propose a novel source code similarity detection approach by considering the dynamic features at runtime of source code using process mining. More specifically, given two pieces of source code, their running logs are obtained by source code instrumentation and execution. Next, process mining is used to obtain the flow charts of the two pieces of source code by analyzing their collected running logs. Finally, similarity of the two pieces of source code is measured by computing the similarity of these two flow charts. Experimental results show that the proposed approach can deal with more complex obfuscation techniques including opaque predicates and loop unrolling as well as function inlining and outlining, which cannot be handled by existing work properly. Therefore, we argue that our approach can defeat commonly used code obfuscation techniques more effectively for source code similarity detection than the existing state-of-the-art approaches.

Download Full-text

DGANet: A Dilated Graph Attention-Based Network for Local Feature Extraction on 3D Point Clouds

Remote Sensing ◽

10.3390/rs13173484 ◽

2021 ◽

Vol 13 (17) ◽

pp. 3484

Author(s):

Jie Wan ◽

Zhong Xie ◽

Yongyang Xu ◽

Ziyin Zeng ◽

Ding Yuan ◽

...

Keyword(s):

Feature Extraction ◽

Long Range ◽

Point Clouds ◽

Feature Representation ◽

Learning Ability ◽

Geometric Feature ◽

Cloud Data ◽

Fine Grained ◽

3D Point Clouds ◽

Benchmark Datasets

Feature extraction on point clouds is an essential task when analyzing and processing point clouds of 3D scenes. However, there still remains a challenge to adequately exploit local fine-grained features on point cloud data due to its irregular and unordered structure in a 3D space. To alleviate this problem, a Dilated Graph Attention-based Network (DGANet) with a certain feature for learning ability is proposed. Specifically, we first build a local dilated graph-like region for each input point to establish the long-range spatial correlation towards its corresponding neighbors, which allows the proposed network to access a wider range of geometric information of local points with their long-range dependencies. Moreover, by integrating the dilated graph attention module (DGAM) implemented by a novel offset–attention mechanism, the proposed network promises to highlight the differing ability of each edge of the constructed local graph to uniquely learn the discrepancy feature of geometric attributes between the connected point pairs. Finally, all the learned edge attention features are further aggregated, allowing the most significant geometric feature representation of local regions by the graph–attention pooling to fully extract local detailed features for each point. The validation experiments using two challenging benchmark datasets demonstrate the effectiveness and powerful generation ability of our proposed DGANet in both 3D object classification and segmentation tasks.

Download Full-text

Using a Dictionary and n-gram Alignment to Improve Fine-grained Cross-Language Plagiarism Detection

Proceedings of the 2016 ACM Symposium on Document Engineering - DocEng '16 ◽

10.1145/2960811.2960817 ◽

2016 ◽

Cited By ~ 5

Author(s):

Nava Ehsan ◽

Frank Wm. Tompa ◽

Azadeh Shakery

Keyword(s):

Plagiarism Detection ◽

Fine Grained ◽

N Gram ◽

Cross Language

Download Full-text

C-GCN: A Flexible CSI Phase Feature Extraction Network for Error Suppression in Indoor Positioning

Entropy ◽

10.3390/e23081004 ◽

2021 ◽

Vol 23 (8) ◽

pp. 1004

Author(s):

Wen Liu ◽

Qianqian Cheng ◽

Zhongliang Deng ◽

Mingjie Jia

Keyword(s):

Feature Extraction ◽

Signal Propagation ◽

Indoor Positioning ◽

Superior Performance ◽

Indoor Environments ◽

Positioning Error ◽

Data Packets ◽

Fine Grained ◽

The Relationship ◽

Error Suppression

Channel state information (CSI) provides a fine-grained description of the signal propagation process, which has attracted extensive attention in the field of indoor positioning. However, considering the influence of environment and hardware, the phase of CSI is distorted in most cases. It is difficult to extract effective location features in multiple scenes only through the determined artificial experience model. Graph neural network has performed well in many fields in recent years, but there is still a lot of room to explore in the field of indoor positioning. In this paper, a phase feature extraction network based on multi-dimensional correlation is proposed, named Cooperation-Graph Convolution Network (C-GCN). The purpose of C-GCN is to extract new features of multiple correlation and to mine the relationship between antenna and subcarrier as much as possible. C-GCN is composed of convolution layer and graph convolution layer. In the graph convolution layer, C-GCN regards each subcarrier of each antenna as a node in the graph network, constructs the connection by the correlation between the antenna and the subcarrier, and aggregates the node vectors by graph convolution. In the convolution layer, there is a natural corresponding structure between data packets, C-GCN extracts the fluctuation with convolution in Euclidean space. C-GCN combines these two layers, and applies end-to-end supervised training to obtain effective features. Extensive experiments are conducted in typical indoor environments to verify the superior performance of C-GCN in restraining error tailing. The average positioning error of C-GCN is 1.29 m in comprehensive office and 1.71 m in garage. Combined with the amplitude feature, the average positioning error is 0.99 m in comprehensive office and 1.14 m in garage.

Download Full-text

Birds Fine-grained Feature Extraction Based on Transfer Learning

10.1109/icsai53574.2021.9664058 ◽

2021 ◽

Author(s):

Peng Wu ◽

Gang Wu

Keyword(s):

Feature Extraction ◽

Transfer Learning ◽

Fine Grained

Download Full-text

Efficient Two-Step Middle-Level Part Feature Extraction for Fine-Grained Visual Categorization

IEICE Transactions on Information and Systems ◽

10.1587/transinf.2015edp7358 ◽

2016 ◽

Vol E99.D (6) ◽

pp. 1626-1634

Author(s):

Hideki NAKAYAMA ◽

Tomoya TSUDA

Keyword(s):

Feature Extraction ◽

Middle Level ◽

Visual Categorization ◽

Fine Grained ◽

Part Feature

Download Full-text

Flowchart-Based Cross-Language Source Code Similarity Detection

Scientific Programming ◽

10.1155/2020/8835310 ◽

2020 ◽

Vol 2020 ◽

pp. 1-15

Author(s):

Feng Zhang ◽

Guofan Li ◽

Cong Liu ◽

Qian Song

Keyword(s):

Programming Languages ◽

Programming Language ◽

Computer Programming ◽

Source Code ◽

Common Source ◽

Intermediate Structure ◽

Plagiarism Detection ◽

Code Obfuscation ◽

Similarity Detection ◽

Cross Language

Source code similarity detection has various applications in code plagiarism detection and software intellectual property protection. In computer programming teaching, students may convert the source code written in one programming language into another language for their code assignment submission. Existing similarity measures of source code written in the same language are not applicable for the cross-language code similarity detection because of syntactic differences among different programming languages. Meanwhile, existing cross-language source similarity detection approaches are susceptible to complex code obfuscation techniques, such as replacing equivalent control structure and adding redundant statements. To solve this problem, we propose a cross-language code similarity detection (CLCSD) approach based on code flowcharts. In general, two source code fragments written in different programming languages are transformed into standardized code flowcharts (SCFC), and their similarity is obtained by measuring their corresponding SCFC. More specifically, we first introduce the standardized code flowchart (SCFC) model to be the uniform flowcharts representation of source code written in different languages. SCFC is language-independent, and therefore, it can be used as the intermediate structure for source code similarity detection. Meanwhile, transformation techniques are given to transform source code written in a specific programming language into an SCFC. Second, we propose the SCFC-SPGK algorithm based on the shortest path graph kernel to measure the similarity between two SCFCs. Thus, the similarity between two pieces of source code in different programming languages is given by the similarity between SCFCs. Experimental results show that compared with existing approaches, CLCSD has higher accuracy in cross-language source code similarity detection. Furthermore, CLCSD cannot only handle common source code obfuscation techniques used by students in computer programming teaching but also obtain nearly 90% accuracy in dealing with some complex obfuscation techniques.

Download Full-text