scholarly journals A BERT-Based Two-Stage Model for Chinese Chengyu Recommendation

Author(s):  
Minghuan Tan ◽  
Jing Jiang ◽  
Bing Tian Dai

In Chinese, Chengyu are fixed phrases consisting of four characters. As a type of idioms, their meanings usually cannot be derived from their component characters. In this article, we study the task of recommending a Chengyu given a textual context. Observing some of the limitations with existing work, we propose a two-stage model, where during the first stage we re-train a Chinese BERT model by masking out Chengyu from a large Chinese corpus with a wide coverage of Chengyu. During the second stage, we fine-tune the re-trained, Chengyu-oriented BERT on a specific Chengyu recommendation dataset. We evaluate this method on ChID and CCT datasets and find that it can achieve the state of the art on both datasets. Ablation studies show that both stages of training are critical for the performance gain.

2020 ◽  
Vol 34 (05) ◽  
pp. 8600-8607
Author(s):  
Haiyun Peng ◽  
Lu Xu ◽  
Lidong Bing ◽  
Fei Huang ◽  
Wei Lu ◽  
...  

Target-based sentiment analysis or aspect-based sentiment analysis (ABSA) refers to addressing various sentiment analysis tasks at a fine-grained level, which includes but is not limited to aspect extraction, aspect sentiment classification, and opinion extraction. There exist many solvers of the above individual subtasks or a combination of two subtasks, and they can work together to tell a complete story, i.e. the discussed aspect, the sentiment on it, and the cause of the sentiment. However, no previous ABSA research tried to provide a complete solution in one shot. In this paper, we introduce a new subtask under ABSA, named aspect sentiment triplet extraction (ASTE). Particularly, a solver of this task needs to extract triplets (What, How, Why) from the inputs, which show WHAT the targeted aspects are, HOW their sentiment polarities are and WHY they have such polarities (i.e. opinion reasons). For instance, one triplet from “Waiters are very friendly and the pasta is simply average” could be (‘Waiters’, positive, ‘friendly’). We propose a two-stage framework to address this task. The first stage predicts what, how and why in a unified model, and then the second stage pairs up the predicted what (how) and why from the first stage to output triplets. In the experiments, our framework has set a benchmark performance in this novel triplet extraction task. Meanwhile, it outperforms a few strong baselines adapted from state-of-the-art related methods.


Sensors ◽  
2018 ◽  
Vol 18 (12) ◽  
pp. 4308 ◽  
Author(s):  
Xiang Zhang ◽  
Wei Yang ◽  
Xiaolin Tang ◽  
Jie Liu

To improve the accuracy of lane detection in complex scenarios, an adaptive lane feature learning algorithm which can automatically learn the features of a lane in various scenarios is proposed. First, a two-stage learning network based on the YOLO v3 (You Only Look Once, v3) is constructed. The structural parameters of the YOLO v3 algorithm are modified to make it more suitable for lane detection. To improve the training efficiency, a method for automatic generation of the lane label images in a simple scenario, which provides label data for the training of the first-stage network, is proposed. Then, an adaptive edge detection algorithm based on the Canny operator is used to relocate the lane detected by the first-stage model. Furthermore, the unrecognized lanes are shielded to avoid interference in subsequent model training. Then, the images processed by the above method are used as label data for the training of the second-stage model. The experiment was carried out on the KITTI and Caltech datasets, and the results showed that the accuracy and speed of the second-stage model reached a high level.


2008 ◽  
Vol 19 (1) ◽  
pp. 85-104 ◽  
Author(s):  
IRA M BERNSTEIN ◽  
MARILYN J CIPOLLA

Current hypotheses regarding the origins of preeclampsia have focused on the “Two stage model”. This model suggests that the primary steps in the pathophysiologic sequence of preeclampsia are initiated by abnormal placentation including the classic finding of abnormal trophoblast invasion of maternal decidual spiral arteries. The second stage of the sequence includes the elaboration of a single or multiple substances from these disordered placentas which contribute to the generalized maternal systemic illness, eventually manifesting as endothelial injury, hypertension and proteinuria. Recent studies have focused on the role of pro and anti-angiogenic peptides as potential placentally derived aetiologic agents in this pathophysiologic sequence, although other placental products have been highlighted in recent research. Despite the fact that this modeling of preeclampsia has widespread support significant limitations to this hypothesis can be identified.


1968 ◽  
Vol 66 (2) ◽  
pp. 273-280 ◽  
Author(s):  
G. G. Meynell ◽  
Joan Maw

SUMMARYColony counts on mice given the same number ofSalmonellaalways differ considerably. However, the standard error of the mean log count does not increase after the first 1·5 hr. of infection until the 8th or 10th day. These infections therefore appear to pass through an initial stage lasting a few hours, in which a varying proportion of the inoculum is killed, followed by a prolonged second stage in which the scatter in individual colony counts remains constant.


2003 ◽  
Vol 17 (2) ◽  
pp. 196-219 ◽  
Author(s):  
Patricia S. Miller ◽  
Gretchen A. Kerr

This study examined the role experimentation of university student athletes using in-depth interviews. The results revealed participants’ role experimentation was limited to three spheres: athletic, academic, and social. Participants’ exploration of and commitment to roles revealed a two-stage model of identity formation. The first stage, Over-Identification with the Athlete Role, revealed a singular focus on athletics that persisted throughout much of the participants’ university careers. The second stage, Deferred Role Experimentation, reflected an increased investment in academic and social roles in the participants’ upper years. Results were consistent with previous findings of an athletic identity among intercollegiate student-athletes (Brewer, Van Raalte, & Linder, 1993), but supported Perna, Zaichkowsky, and Bocknek’s (1996) suggestion that identity foreclosure may have been overgeneralized.


Perception ◽  
1978 ◽  
Vol 7 (4) ◽  
pp. 371-383 ◽  
Author(s):  
J Timothy Petersik ◽  
Kenyon I Hicks ◽  
Allan J Pantle

In the present studies a pair of random-dot frames was constructed so that two areas in the first frame (f1) were correlated with two areas in the second frame (f2). The alternation of the pair of frames (an f1-f2 sequence) gave rise to two subjective figures. When two pairs of random-dot frames (an f1-f2 sequence and an f3-f4 sequence), each of which produced two subjective figures in different locations, were themselves alternated, the subjective figures from the f1-f2 sequence interacted with the subjective figures from the f3-f4 sequence to produce apparent movement. With any one of the four general kinds of displays which we constructed, subjects usually perceived only one of two types of subjective-figure movement. The type of movement that was perceived with a given display depended primarily upon the degree of change (across the interval between an f1-f2 and an f3-f4 sequence) of the internal structure of the successively generated subjective figures. Relative intensity differences between the subjective figures and their backgrounds influenced the type of apparent movement seen, whereas variations in the density of elements in a display did not. We tentatively propose a two-stage model to explain the apparent movement of the subjective figures: the first stage is assumed to generate the subjective figures by means of a cross-correlation of the intensity distributions of the two frames within an f1-f2 sequence and within an f3-f4 sequence; on the basis of inputs from the first stage, the second stage generates apparent movement signals for the subjective figures.


Author(s):  
Hui Ying ◽  
Zhaojin Huang ◽  
Shu Liu ◽  
Tianjia Shao ◽  
Kun Zhou

Current instance segmentation methods can be categorized into segmentation-based methods and proposal-based methods. The former performs segmentation first and then does clustering, while the latter detects objects first and then predicts the mask for each object proposal. In this work, we propose a single-stage method, named EmbedMask, that unifies both methods by taking their advantages, so it can achieve good performance in instance segmentation and produce high-resolution masks in a high speed. EmbedMask introduces two newly defined embeddings for mask prediction, which are pixel embedding and proposal embedding. During training, we enforce the pixel embedding to be close to its coupled proposal embedding if they belong to the same instance. During inference, pixels are assigned to the mask of the proposal if their embeddings are similar. This mechanism brings several benefits. First, the pixel-level clustering enables EmbedMask to generate high-resolution masks and avoids the complicated two-stage mask prediction. Second, the existence of proposal embedding simplifies and strengthens the clustering procedure, so our method can achieve high speed and better performance than segmentation-based methods. Without any bell or whistle, EmbedMask outperforms the state-of-the-art instance segmentation method Mask R-CNN on the challenging COCO dataset, obtaining more detailed masks at a higher speed.


2018 ◽  
Vol 19 (2) ◽  
pp. 393-408 ◽  
Author(s):  
Yumeng Tao ◽  
Kuolin Hsu ◽  
Alexander Ihler ◽  
Xiaogang Gao ◽  
Soroosh Sorooshian

Abstract Compared to ground precipitation measurements, satellite-based precipitation estimation products have the advantage of global coverage and high spatiotemporal resolutions. However, the accuracy of satellite-based precipitation products is still insufficient to serve many weather, climate, and hydrologic applications at high resolutions. In this paper, the authors develop a state-of-the-art deep learning framework for precipitation estimation using bispectral satellite information, infrared (IR), and water vapor (WV) channels. Specifically, a two-stage framework for precipitation estimation from bispectral information is designed, consisting of an initial rain/no-rain (R/NR) binary classification, followed by a second stage estimating the nonzero precipitation amount. In the first stage, the model aims to eliminate the large fraction of NR pixels and to delineate precipitation regions precisely. In the second stage, the model aims to estimate the pointwise precipitation amount accurately while preserving its heavily skewed distribution. Stacked denoising autoencoders (SDAEs), a commonly used deep learning method, are applied in both stages. Performance is evaluated along a number of common performance measures, including both R/NR and real-valued precipitation accuracy, and compared with an operational product, Precipitation Estimation from Remotely Sensed Information Using Artificial Neural Networks–Cloud Classification System (PERSIANN-CCS). For R/NR binary classification, the proposed two-stage model outperforms PERSIANN-CCS by 32.56% in the critical success index (CSI). For real-valued precipitation estimation, the two-stage model is 23.40% lower in average bias, is 44.52% lower in average mean squared error, and has a 27.21% higher correlation coefficient. Hence, the two-stage deep learning framework has the potential to serve as a more accurate and more reliable satellite-based precipitation estimation product. The authors also provide some future directions for development of satellite-based precipitation estimation products in both incorporating auxiliary information and improving retrieval algorithms.


Author(s):  
Ankit Bhatnagar ◽  
S. Pushpavanam

In this work a one dimensional steady state model is developed for a single stage and two-stage bottom fed entrained flow coal gasifier for. The single stage model was first analysed for two different oxidants (i) oxygen and (ii) air to study their effects on gasification. Analysis proved oxygen to be the better oxidant. The model is then extended to a two stage gasifier. Here 30 (70) percent of the coal is fed in the first stage (second stage). The first (second) stage operates in oxidant rich (lean) environment. The performances of single stage and two-stage models are compared in terms of their cold gas efficiencies for the same coal feed rates. It is observed that a two-stage system has better cold gas efficiency (77%) than a single stage system (72%) with lower oxygen consumption. The two-stage model is used to optimise the O2/Coal ratio as the H2O/Coal ratio is varied in the 2nd stage. The optimum yields the highest cold gas efficiency with minimum possible oxygen consumption.


Frequenz ◽  
2014 ◽  
Vol 68 (9-10) ◽  
Author(s):  
Erhan Ersoy ◽  
Serguei Chevtchenko ◽  
Paul Kurpas ◽  
Wolfgang Heinrich

AbstractWhile the vast majority of GaN X-band PAs is realized as microstrip circuits, this paper reports design, fabrication and measurement of a coplanar version. The amplifier is processed using the FBH 4-inch GaN-on-SiC technology with 0.25 µm-gate GaN HEMTs. The two-stage power amplifier circuit delivers more than 12 W cw output power at 10 GHz, with a large-signal gain of 20 dB and a final stage drain efficiency of 45%. Benchmarking shows that these are best-in-class values for a coplanar X-band MMIC, which come very close to the state-of-the-art microstrip counterparts.


Sign in / Sign up

Export Citation Format

Share Document