A shallow parser based on closed-class words to capture relations in biomedical text

Gondy Leroy; Hsinchun Chen; Jesse D Martinez

doi:10.1016/s1532-0464(03)00039-x

Adjuncts as a diagnostic of polysynthetic word formation in Inuit

10.1093/oso/9780198778264.003.0013 ◽

2017 ◽

Author(s):

Richard Compton

Keyword(s):

Head Movement ◽

Word Formation ◽

Diagnostic Device ◽

Closed Class ◽

Variable Ordering ◽

Mirror Principle

This chapter examines polysynthetic word formation in Inuit (Eskimo-Aleut), using the presence and variable ordering of a closed class of adverbs within verbal complexes as a diagnostic device to evaluate the adequacy of different accounts of word formation. It is argued that a head movement account of Mirror Principle orders within Inuit words undergenerates with respect to the observed variation in adverb ordering, particularly if a fixed hierarchy of adverbial functional projections is assumed, as in Cinque (1999). Instead, it is shown that an analysis that employs a right-headed structure, XP-sized phasal words, and Ernst’s (2002) semantically based framework of adverb licensing better captures the observed variation.

Download Full-text

Applications of Machine Learning in Biomedical Text Processing and Food Industry

Machine Learning for Healthcare Applications ◽

10.1002/9781119792611.ch10 ◽

2021 ◽

pp. 151-167

Author(s):

K. Paramesha ◽

H.L. Gururaj ◽

Om Prakash Jena

Keyword(s):

Machine Learning ◽

Food Industry ◽

Text Processing ◽

Biomedical Text ◽

Applications Of Machine Learning

Download Full-text

GHS-NET A Generic Hybridized Shallow Neural Network for Multi-Label Biomedical Text Classification

Journal of Biomedical Informatics ◽

10.1016/j.jbi.2021.103699 ◽

2021 ◽

pp. 103699

Author(s):

Muhammad Ali Ibrahim ◽

Muhammad Usman Ghani Khan ◽

Faiza Mehmood ◽

Muhammad Nabeel Asim ◽

Waqar Mahmood

Keyword(s):

Neural Network ◽

Text Classification ◽

Biomedical Text ◽

Biomedical Text Classification

Download Full-text

Biomedical text similarity evaluation using attention mechanism and siamese neural network

IEEE Access ◽

10.1109/access.2021.3099021 ◽

2021 ◽

pp. 1-1

Author(s):

Z.G. Li ◽

H. Chen ◽

H.Y. Chen

Keyword(s):

Neural Network ◽

Attention Mechanism ◽

Biomedical Text ◽

Text Similarity

Download Full-text

Extraction of causal relations based on SBEL and BERT model

Database ◽

10.1093/database/baab005 ◽

2021 ◽

Vol 2021 ◽

Author(s):

Yifan Shao ◽

Haoru Li ◽

Jinghang Gu ◽

Longhua Qian ◽

Guodong Zhou

Keyword(s):

State Of The Art ◽

Causal Relation ◽

Relation Extraction ◽

The Other ◽

Biomedical Text ◽

Intermediate Form ◽

Biomedical Text Mining ◽

Causal Relations ◽

The One ◽

Stage 1

Abstract Extraction of causal relations between biomedical entities in the form of Biological Expression Language (BEL) poses a new challenge to the community of biomedical text mining due to the complexity of BEL statements. We propose a simplified form of BEL statements [Simplified Biological Expression Language (SBEL)] to facilitate BEL extraction and employ BERT (Bidirectional Encoder Representation from Transformers) to improve the performance of causal relation extraction (RE). On the one hand, BEL statement extraction is transformed into the extraction of an intermediate form—SBEL statement, which is then further decomposed into two subtasks: entity RE and entity function detection. On the other hand, we use a powerful pretrained BERT model to both extract entity relations and detect entity functions, aiming to improve the performance of two subtasks. Entity relations and functions are then combined into SBEL statements and finally merged into BEL statements. Experimental results on the BioCreative-V Track 4 corpus demonstrate that our method achieves the state-of-the-art performance in BEL statement extraction with F1 scores of 54.8% in Stage 2 evaluation and of 30.1% in Stage 1 evaluation, respectively. Database URL: https://github.com/grapeff/SBEL_datasets

Download Full-text

‘Almost people’: A Learner Corpus Account of L2 Use and Misuse of Non-numerical Quantification

Open Linguistics ◽

10.1515/opli-2016-0015 ◽

2016 ◽

Vol 2 (1) ◽

Author(s):

Peter Crosthwaite ◽

Lavigne L.Y. Choy ◽

Yeonsuk Bae

Keyword(s):

English Learners ◽

English Speakers ◽

L2 Proficiency ◽

L1 Transfer ◽

Learner Corpus ◽

Proficiency Level ◽

Closed Class ◽

Corpus Data ◽

Noun Number ◽

L1 English

AbstractWe present an Integrated Contrastive Model of non-numerical quantificational NPs (NNQs, i.e. ‘some people’) produced by L1 English speakers and Mandarin and Korean L2 English learners. Learner corpus data was sourced from the ICNALE (Ishikawa, 2011, 2013) across four L2 proficiency levels. An average 10% of L2 NNQs were specific to L2 varieties, including noun number mismatches (*‘many child’), omitting obligatory quantifiers after adverbs (*‘almost people’), adding unnecessary particles (*‘all of people’) and non-L1 English-like quantifier/noun agreement (*‘many water’). Significantly fewer ‘openclass’ NNQs (e.g a number of people) are produced by L2 learners, preferring ‘closed-class’ single lexical quantifiers (following L1-like use). While such production is predictable via L1 transfer, Korean L2 English learners produced significantly more L2-like NNQs at each proficiency level, which was not entirely predictable under a transfer account. We thus consider whether positive transfer of other linguistic forms (i.e. definiteness marking) aids the learnability of other L2 forms (i.e. expression of quantification).

Download Full-text

Biomedical text summarisation using concept chains

International Journal of Data Mining and Bioinformatics ◽

10.1504/ijdmb.2007.012967 ◽

2007 ◽

Vol 1 (4) ◽

pp. 389 ◽

Cited By ~ 8

Author(s):

Lawrence H. Reeve ◽

Hyoil Han ◽

Ari D. Brooks

Keyword(s):

Biomedical Text

Download Full-text

Automatic discourse connective detection in biomedical text

Journal of the American Medical Informatics Association ◽

10.1136/amiajnl-2011-000775 ◽

2012 ◽

Vol 19 (5) ◽

pp. 800-808 ◽

Cited By ~ 7

Author(s):

Balaji Polepalli Ramesh ◽

Rashmi Prasad ◽

Tim Miller ◽

Brian Harrington ◽

Hong Yu

Keyword(s):

Biomedical Text

Download Full-text

A broadened estimate of syntactic and lexical ability from the MB-CDI

Journal of Child Language ◽

10.1017/s0305000921000283 ◽

2021 ◽

pp. 1-18

Author(s):

Trevor K.M. DAY ◽

Jed T. ELISON

Keyword(s):

Significant Proportion ◽

Critical Question ◽

Function Words ◽

Subsequent Investigation ◽

Factor Analytic ◽

Closed Class ◽

Communicative Development ◽

Age Related ◽

Syntactic Development ◽

Age Related Changes

Abstract A critical question in the study of language development is to understand lexical and syntactic acquisition, which play different roles in speech to the extent it would be natural to surmise they are acquired differently. As measured through the comprehension and production of closed-class words, syntactic ability emerges at roughly the 400-word mark. However, a significant proportion of the developmental work uses a coarse combination of function and content words on the MacArthur-Bates Communicative Development Inventory (MB-CDI). Using the MB-CDI Wordbank database, we implemented a factor analytic approach to distinguish between lexical and syntactic development from the Words and Sentences (WS) form that involves both function words and the explicit categorizations. Although the Words and Gestures (WG) form did not share the factor structure, common WG/WS elements recapitulate the expected age-related changes. This parsing of the MB-CDI may prove simple, yet fruitful in subsequent investigation.

Download Full-text

Constraints on Code-Switching: Evidence from Swedish and English

Nordic Journal of Linguistics ◽

10.1017/s0332586500001414 ◽

1986 ◽

Vol 9 (1) ◽

pp. 55-82

Author(s):

Beata Schmid

Keyword(s):

Code Switching ◽

Optimal Switching ◽

Specific Point ◽

Clear Cut ◽

Clear Sense ◽

Closed Class ◽

The Matrix ◽

Category Equivalence ◽

Communicative Context

In this paper, I have shown that Joshi's (1982) framework of codeswitching constraints can largely be applied to Swedish-English code-switches. I feel qualified to conclude that Joshi's claims concerning the non-switchability of closed class items and matrix language and embedded languages are held up by the Swedish- English data. The need for corresponding categories proved to be less clear-cut than originally proposed by Woolford (1983) and others. It seems that optimal switching conditions are given if the categories, rules and metarules correspond in the two languages. Apparently, however, it is also possible to switch if the node admissibility conditions for the matrix language only are met, as was shown by code-switched sentences containing RPs. This requires that the speaker has a clear sense of which language is the host and which is embedded. Rules from the embedded language only are not acceptable. This calls for some sort of determination strategy by the parser. I found no evidence for determining Lm at any specific point in the sentence, except at the topmost S. Rather, the judgments by code-switchers that a sentence “comes from” one language seems to coincide with the fact that the resulting sentence is based on the rules from that language. Other than that, the matrix language is determined by the communicative context as a whole.The data involving RPs also seemed to indicate that RPs are not separate ategories, but are NPs, introduced by a “de-slashing” rule (Sells 1984). If they were separate categories, this would be evidence for there being no need for category equivalence. In this case, we would have to explicitly state all other cases which require category equivalence (the majority of cases), which is undesirable.

Download Full-text