Tag-Enhanced Dynamic Compositional Neural Network over arbitrary tree structure for sentence representation

2021 ◽  
pp. 115182
Author(s):  
Chunlin Xu ◽  
Hui Wang ◽  
Shengli Wu ◽  
Zhiwei Lin
2003 ◽  
Vol 03 (01) ◽  
pp. 119-143 ◽  
Author(s):  
ZHIYONG WANG ◽  
ZHERU CHI ◽  
DAGAN FENG ◽  
AH CHUNG TSOI

Content-based image retrieval has become an essential technique in multimedia data management. However, due to the difficulties and complications involved in the various image processing tasks, a robust semantic representation of image content is still very difficult (if not impossible) to achieve. In this paper, we propose a novel content-based image retrieval approach with relevance feedback using adaptive processing of tree-structure image representation. In our approach, each image is first represented with a quad-tree, which is segmentation free. Then a neural network model with the Back-Propagation Through Structure (BPTS) learning algorithm is employed to learn the tree-structure representation of the image content. This approach that integrates image representation and similarity measure in a single framework is applied to the relevance feedback of the content-based image retrieval. In our approach, an initial ranking of the database images is first carried out based on the similarity between the query image and each of the database images according to global features. The user is then asked to categorize the top retrieved images into similar and dissimilar groups. Finally, the BPTS neural network model is used to learn the user's intention for a better retrieval result. This process continues until satisfactory retrieval results are achieved. In the refining process, a fine similarity grading scheme can also be adopted to improve the retrieval performance. Simulations on texture images and scenery pictures have demonstrated promising results which compare favorably with the other relevance feedback methods tested.


2014 ◽  
Vol 06 (01) ◽  
pp. 1450001 ◽  
Author(s):  
MIN-SUNG KOH ◽  
DANILO P. MANDIC ◽  
ANTHONY G. CONSTANTINIDES

Undecimated and decimated multivariate empirical mode decomposition filter banks (MEMDFBs) are introduced in order to incorporate MEMD equipped with downsampling into any arbitrary tree structure and provide flexibility in the choice of frequency bands. Undecimated MEMDFBs show the same results as those of original MEMD for an octave tree structure. Since the exact cut-off frequencies of MEMD are not known (i.e. due to data-driven decomposition), employing just simple downsampling in MEMD might cause aliasing. However, decimated MEMDFBs in this paper achieve perfect reconstruction with aliasing cancelled for any arbitrary tree. Applications of decimated/undecimated MEMDFBs for speech/audio and image signals are also included. Since decimated MEMDFBs can be applied into any arbitrary tree structure, this extends into MEMD packets. Arbitrary tree structures in decimated MEMDFBs also lead to more diverse choices in frequency bands for various multivariate applications requiring decimations.


Author(s):  
Zhou Cheng ◽  
Chun Yuan ◽  
Jiancheng Li ◽  
Haiqin Yang

Recursive neural network (RvNN) has been proved to be an effective and promising tool to learn sentence representations by explicitly exploiting the sentence structure. However, most existing work can only exploit simple tree structure, e.g., binary trees, or ignore the order of nodes, which yields suboptimal performance. In this paper, we proposed a novel neural network, namely TreeNet, to capture sentences structurally over the raw unconstrained constituency trees, where the number of child nodes can be arbitrary. In TreeNet, each node is learning from its left sibling and right child in a bottom-up left-to-right order, thus enabling the net to learn over any tree. Furthermore, multiple soft gates and a memory cell are employed in implementing the TreeNet to determine to what extent it should learn, remember and output, which proves to be a simple and efficient mechanism for semantic synthesis. Moreover, TreeNet significantly suppresses convolutional neural networks (CNN) and Long Short-Term Memory (LSTM) with fewer parameters. It improves the classification accuracy by 2%-5% with 42% of the best CNN’s parameters or 94% of standard LSTM’s. Extensive experiments demonstrate TreeNet achieves the state-of-the-art performance on all four typical text classification tasks.


Author(s):  
HANG-JOON KIM ◽  
SANG-KYOON KIM

This paper proposes an efficient method for on-line recognition of cursive Korean characters. Strokes, primitive components of Korean characters, are usually warped into a cursive form and classifying them is very difficult. To deal with such cursive strokes, we consider them as a recognition unit and automatically classify them by using an ART-2 neural network. The neural network has the advantage of assembling similar patterns together to form classes in a self-organized manner. This ART-2 stroke classifier contributes to high stroke recognition rate and less recognition time. A database for character recognition is also dynamically constructed with a tree structure, and a new character can be included simply by adding a new sequence to it. Character recognition is achieved by traversing the database with a sequence of recognized strokes and positional relations between the strokes. To verify the performance of the system, we tested it on 17,500 handwritten characters, and obtained a good recognition rate of 96.8% and a speed of 0.52 second per character. This results suggest that the proposed method is pertinent to be put into practical use.


2001 ◽  
Vol 46 (9) ◽  
pp. 744-746 ◽  
Author(s):  
Jiangtao Jia ◽  
Sheng Wu ◽  
Yawen Zhang ◽  
Chunsheng Liao ◽  
Chunhua Yan ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document