Tag-Enhanced Dynamic Compositional Neural Network over arbitrary tree structure for sentence representation

Content-based image retrieval has become an essential technique in multimedia data management. However, due to the difficulties and complications involved in the various image processing tasks, a robust semantic representation of image content is still very difficult (if not impossible) to achieve. In this paper, we propose a novel content-based image retrieval approach with relevance feedback using adaptive processing of tree-structure image representation. In our approach, each image is first represented with a quad-tree, which is segmentation free. Then a neural network model with the Back-Propagation Through Structure (BPTS) learning algorithm is employed to learn the tree-structure representation of the image content. This approach that integrates image representation and similarity measure in a single framework is applied to the relevance feedback of the content-based image retrieval. In our approach, an initial ranking of the database images is first carried out based on the similarity between the query image and each of the database images according to global features. The user is then asked to categorize the top retrieved images into similar and dissimilar groups. Finally, the BPTS neural network model is used to learn the user's intention for a better retrieval result. This process continues until satisfactory retrieval results are achieved. In the refining process, a fine similarity grading scheme can also be adopted to improve the retrieval performance. Simulations on texture images and scenery pictures have demonstrated promising results which compare favorably with the other relevance feedback methods tested.

Download Full-text

THEORY OF DIGITAL FILTER BANKS REALIZED VIA MULTIVARIATE EMPIRICAL MODE DECOMPOSITION

Advances in Adaptive Data Analysis ◽

10.1142/s1793536914500010 ◽

2014 ◽

Vol 06 (01) ◽

pp. 1450001 ◽

Cited By ~ 4

Author(s):

MIN-SUNG KOH ◽

DANILO P. MANDIC ◽

ANTHONY G. CONSTANTINIDES

Keyword(s):

Empirical Mode Decomposition ◽

Filter Banks ◽

Tree Structure ◽

Perfect Reconstruction ◽

Tree Structures ◽

Frequency Bands ◽

Multivariate Empirical Mode Decomposition ◽

Mode Decomposition ◽

Digital Filter Banks ◽

Arbitrary Tree

Undecimated and decimated multivariate empirical mode decomposition filter banks (MEMDFBs) are introduced in order to incorporate MEMD equipped with downsampling into any arbitrary tree structure and provide flexibility in the choice of frequency bands. Undecimated MEMDFBs show the same results as those of original MEMD for an octave tree structure. Since the exact cut-off frequencies of MEMD are not known (i.e. due to data-driven decomposition), employing just simple downsampling in MEMD might cause aliasing. However, decimated MEMDFBs in this paper achieve perfect reconstruction with aliasing cancelled for any arbitrary tree. Applications of decimated/undecimated MEMDFBs for speech/audio and image signals are also included. Since decimated MEMDFBs can be applied into any arbitrary tree structure, this extends into MEMD packets. Arbitrary tree structures in decimated MEMDFBs also lead to more diverse choices in frequency bands for various multivariate applications requiring decimations.

Download Full-text

TreeNet: Learning Sentence Representations with Unconstrained Tree Structure

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/557 ◽

2018 ◽

Cited By ~ 3

Author(s):

Zhou Cheng ◽

Chun Yuan ◽

Jiancheng Li ◽

Haiqin Yang

Keyword(s):

Neural Network ◽

Short Term Memory ◽

Memory Cell ◽

Tree Structure ◽

Sentence Structure ◽

Promising Tool ◽

Classification Tasks ◽

Long Short Term Memory ◽

Right Order ◽

Simple Tree

Recursive neural network (RvNN) has been proved to be an effective and promising tool to learn sentence representations by explicitly exploiting the sentence structure. However, most existing work can only exploit simple tree structure, e.g., binary trees, or ignore the order of nodes, which yields suboptimal performance. In this paper, we proposed a novel neural network, namely TreeNet, to capture sentences structurally over the raw unconstrained constituency trees, where the number of child nodes can be arbitrary. In TreeNet, each node is learning from its left sibling and right child in a bottom-up left-to-right order, thus enabling the net to learn over any tree. Furthermore, multiple soft gates and a memory cell are employed in implementing the TreeNet to determine to what extent it should learn, remember and output, which proves to be a simple and efficient mechanism for semantic synthesis. Moreover, TreeNet significantly suppresses convolutional neural networks (CNN) and Long Short-Term Memory (LSTM) with fewer parameters. It improves the classification accuracy by 2%-5% with 42% of the best CNN’s parameters or 94% of standard LSTM’s. Extensive experiments demonstrate TreeNet achieves the state-of-the-art performance on all four typical text classification tasks.

Download Full-text

A Neural Network Model to Include Textual Dependency Tree Structure in Gender Classification of Russian Text Author

Advanced Technologies in Robotics and Intelligent Systems - Mechanisms and Machine Science ◽

10.1007/978-3-030-33491-8_48 ◽

2020 ◽

pp. 405-412

Author(s):

A. Sboev ◽

A. Selivanov ◽

R. Rybka ◽

I. Moloshnikov ◽

D. Bogachev

Keyword(s):

Neural Network ◽

Network Model ◽

Neural Network Model ◽

Tree Structure ◽

Russian Text ◽

Gender Classification ◽

Dependency Tree

Download Full-text

ON-LINE RECOGNITION OF CURSIVE KOREAN CHARACTERS USING ART-BASED STROKE CLASSIFICATION (RECOGNITION OF CURSIVE KOREAN CHARACTERS)

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001496000463 ◽

1996 ◽

Vol 10 (07) ◽

pp. 791-812 ◽

Cited By ~ 3

Author(s):

HANG-JOON KIM ◽

SANG-KYOON KIM

Keyword(s):

Neural Network ◽

Efficient Method ◽

Character Recognition ◽

Recognition Rate ◽

Tree Structure ◽

Recognition Time ◽

Stroke Classification ◽

Self Organized ◽

The Neural Network ◽

On Line

This paper proposes an efficient method for on-line recognition of cursive Korean characters. Strokes, primitive components of Korean characters, are usually warped into a cursive form and classifying them is very difficult. To deal with such cursive strokes, we consider them as a recognition unit and automatically classify them by using an ART-2 neural network. The neural network has the advantage of assembling similar patterns together to form classes in a self-organized manner. This ART-2 stroke classifier contributes to high stroke recognition rate and less recognition time. A database for character recognition is also dynamically constructed with a tree structure, and a new character can be included simply by adding a new sequence to it. Character recognition is achieved by traversing the database with a sequence of recognized strokes and positional relations between the strokes. To verify the performance of the system, we tested it on 17,500 handwritten characters, and obtained a good recognition rate of 96.8% and a speed of 0.52 second per character. This results suggest that the proposed method is pertinent to be put into practical use.

Download Full-text