Extrinsic Evaluation of Cross-Lingual Embeddings on the Patent Classification Task

Author(s):  
Anastasiia Ryzhova ◽  
Ilya Sochenkov
2021 ◽  
Author(s):  
Arousha Haghighian Roudsari ◽  
Jafar Afshar ◽  
Wookey Lee ◽  
Suan Lee

AbstractPatent classification is an expensive and time-consuming task that has conventionally been performed by domain experts. However, the increase in the number of filed patents and the complexity of the documents make the classification task challenging. The text used in patent documents is not always written in a way to efficiently convey knowledge. Moreover, patent classification is a multi-label classification task with a large number of labels, which makes the problem even more complicated. Hence, automating this expensive and laborious task is essential for assisting domain experts in managing patent documents, facilitating reliable search, retrieval, and further patent analysis tasks. Transfer learning and pre-trained language models have recently achieved state-of-the-art results in many Natural Language Processing tasks. In this work, we focus on investigating the effect of fine-tuning the pre-trained language models, namely, BERT, XLNet, RoBERTa, and ELECTRA, for the essential task of multi-label patent classification. We compare these models with the baseline deep-learning approaches used for patent classification. We use various word embeddings to enhance the performance of the baseline models. The publicly available USPTO-2M patent classification benchmark and M-patent datasets are used for conducting experiments. We conclude that fine-tuning the pre-trained language models on the patent text improves the multi-label patent classification performance. Our findings indicate that XLNet performs the best and achieves a new state-of-the-art classification performance with respect to precision, recall, F1 measure, as well as coverage error, and LRAP.


2012 ◽  
Author(s):  
Xin Liu ◽  
Xiaobin Zhou ◽  
Jianjun Zhu ◽  
Jing-Jen Wang

2015 ◽  
Author(s):  
Qiang Chen ◽  
Wenjie Li ◽  
Yu Lei ◽  
Xule Liu ◽  
Yanxiang He

Author(s):  
Xiaodan Zhuang ◽  
Arnab Ghoshal ◽  
Antti-Veikko Rosti ◽  
Matthias Paulik ◽  
Daben Liu

2019 ◽  
Vol 951 (9) ◽  
pp. 25-39
Author(s):  
V.V. Zabavnikov ◽  
A.N. Kobiakov ◽  
S.V. Kovalev

Informational and analytical studying patent documentation shows the patenting situation either in general in a specific technological area or the patent activity of innovation entities, taking temporal dynamics and the territorial basis into account. Patent-information investigation was carried out in order to get acquainted with the level of photogrammetry technology development and determine its current application areas. Statistical and intellectual patent document text analysis was the basis for relevant data array grouped in 8680 patent families’ creation. The prepared report contains a graphical display of selected patent documents array, related to research topic, analytical and statistical processing. The level of inventive activity was assessed; the world patenting dynamics and location in this technical field were considered. The main groups on the International Patent Classification, as well as the main technological directions, where technical solutions related to the object of study to be patented, are identified. Information on the leading applicants/ patent holders in this technical field is provided; the list of the most cited patent documents is considered.


Sign in / Sign up

Export Citation Format

Share Document