A Topic Detection Method Based on Word-attention Networks

Abstract Purpose We proposed a method to represent scientific papers by a complex network, which combines the approaches of neural and complex networks. Design/methodology/approach Its novelty is representing a paper by a word branch, which carries the sequential structure of words in sentences. The branches are generated by the attention mechanism in deep learning models. We connected those branches at the positions of their common words to generate networks, called word-attention networks, and then detect their communities, defined as topics. Findings Those detected topics can carry the sequential structure of words in sentences, represent the intra- and inter-sentential dependencies among words, and reveal the roles of words playing in them by network indexes. Research limitations The parameter setting of our method may depend on practical data. Thus it needs human experience to find proper settings. Practical implications Our method is applied to the papers of the PNAS, where the discipline designations provided by authors are used as the golden labels of papers’ topics. Originality/value This empirical study shows that the proposed method outperforms the Latent Dirichlet Allocation and is more stable.

Download Full-text

Identifying Scientific and Technical “Unicorns”

Journal of Data and Information Science ◽

10.2478/jdis-2021-0002 ◽

2020 ◽

Vol 0 (0) ◽

Author(s):

Lucy L. Xu ◽

Miao Qi ◽

Fred Y. Ye

Keyword(s):

Linear Model ◽

Design Methodology ◽

Science And Technology ◽

High Impact ◽

Future Studies ◽

Scientific Papers ◽

Practical Implications ◽

Very High ◽

Quantitative Consideration

AbstractPurposeUsing the metaphor of “unicorn,” we identify the scientific papers and technical patents characterized by the informetric feature of very high citations in the first ten years after publishing, which may provide a new pattern to understand very high impact works in science and technology.Design/methodology/approachWhen we set CT as the total citations of papers or patents in the first ten years after publication, with CT≥ 5,000 for scientific “unicorn” and CT≥ 500 for technical “unicorn,” we have an absolute standard for identifying scientific and technical “unicorn” publications.FindingsWe identify 165 scientific “unicorns” in 14,301,875 WoS papers and 224 technical “unicorns” in 13,728,950 DII patents during 2001–2012. About 50% of “unicorns” belong to biomedicine, in which selected cases are individually discussed. The rare “unicorns” increase following linear model, the fitting data show 95% confidence with the RMSE of scientific “unicorn” is 0.2127 while the RMSE of technical “unicorn” is 0.0923.Research limitationsA “unicorn” is a pure quantitative consideration without concerning its quality, and “potential unicorns” as CT≤5,000 for papers and CT≤500 for patents are left in future studies.Practical implicationsScientific and technical “unicorns” provide a new pattern to understand high-impact works in science and technology. The “unicorn” pattern supplies a concise approach to identify very high-impact scientific papers and technical patents.Originality/valueThe “unicorn” pattern supplies a concise approach to identify very high impact scientific papers and technical patents.

Download Full-text

Emerging Research Topic Detection Using Filtered-LDA

AI ◽

10.3390/ai2040035 ◽

2021 ◽

Vol 2 (4) ◽

pp. 578-599

Author(s):

Fuad Alattar ◽

Khaled Shaalan

Keyword(s):

Final Stage ◽

Topic Modeling ◽

Latent Dirichlet Allocation ◽

Research Topic ◽

Main Topic ◽

Topic Detection ◽

Scientific Papers ◽

Communities of co-commenting in the Russian LiveJournal and their topical coherence

Internet Research ◽

10.1108/intr-03-2014-0079 ◽

2016 ◽

Vol 26 (3) ◽

pp. 710-732 ◽

Cited By ~ 5

Author(s):

Olessia Koltsova ◽

Sergei Koltcov ◽

Sergey Nikolenko

Keyword(s):

Social Studies ◽

Topic Modeling ◽

Design Methodology ◽

Latent Dirichlet Allocation ◽

Opinion Leaders ◽

Reliability Of Results ◽

Policy Makers ◽

Content Type ◽

The Social ◽

Practical Implications

Purpose – The paper addresses the problem of what drives the formation of latent discussion communities, if any, in the blogosphere: topical composition of posts or their authorship? The purpose of this paper is to contribute to the knowledge about structure of co-commenting. Design/methodology/approach – The research is based on a dataset of 17,386 full text posts written by top 2,000 LiveJournal bloggers and over 520,000 comments that result in about 4.5 million edges in the network of co-commenting, where posts are vertices. The Louvain algorithm is used to detect communities of co-commenting. Cosine similarity and topic modeling based on latent Dirichlet allocation are applied to study topical coherence within these communities. Findings – Bloggers unite into moderately manifest communities by commenting roughly the same sets of posts. The graph of co-commenting is sparse and connected by a minority of active non-top commenters. Communities are centered mainly around blog authors as opinion leaders and, to a lesser extent, around a shared topic or topics. Research limitations/implications – The research has to be replicated on other datasets with more thorough hand coding to ensure the reliability of results and to reveal average proportions of topic-centered communities. Practical implications – Knowledge about factors around which co-commenting communities emerge, in particular clustered opinion leaders that often attract such communities, can be used by policy makers in marketing and/or political campaigning when individual leadership is not enough or not applicable. Originality/value – The research contributes to the social studies of online communities. It is the first study of communities based on co-commenting that combines examination of the content of commented posts and their topics.

Download Full-text

Topic Detection Based on Weak Tie Analysis: A Case Study of LIS Research

Journal of Data and Information Science ◽

10.20309/jdis.201626 ◽

2017 ◽

Vol 1 (4) ◽

pp. 81-101

Author(s):

Ling Wei ◽

Haiyun Xu ◽

Zhenmeng Wang ◽

Kun Dong ◽

Chao Wang ◽

...

Keyword(s):

High Frequency ◽

Design Methodology ◽

Weak Ties ◽

Topic Detection ◽

Independent Research ◽

Research Topics ◽

Strong Ties ◽

Parameter Values ◽

Practical Implications

AbstractPurposeBased on the weak tie theory, this paper proposes a series of connection indicators of weak tie subnets and weak tie nodes to detect research topics, recognize their connections, and understand their evolution.Design/methodology/approachFirst, keywords are extracted from article titles and preprocessed. Second, high-frequency keywords are selected to generate weak tie co-occurrence networks. By removing the internal lines of clustered sub-topic networks, we focus on the analysis of weak tie subnets’ composition and functions and the weak tie nodes’ roles.FindingsThe research topics’ clusters and themes changed yearly; the subnets clustered with technique-related and methodology-related topics have been the core, important subnets for years; while close subnets are highly independent, research topics are generally concentrated and most topics are application-related; the roles and functions of nodes and weak ties are diversified.Research limitationsThe parameter values are somewhat inconsistent; the weak tie subnets and nodes are classified based on empirical observations, and the conclusions are not verified or compared to other methods.Practical implicationsThe research is valuable for detecting important research topics as well as their roles, interrelations, and evolution trends.Originality/valueTo contribute to the strength of weak tie theory, the research translates weak and strong ties concepts to co-occurrence strength, and analyzes weak ties’ functions. Also, the research proposes a quantitative method to classify and measure the topics’ clusters and nodes.

Download Full-text

Automatic stepping for circumferential splice drilling in aircraft fuselage assembly

Industrial Robot the international journal of robotics research and application ◽

10.1108/ir-06-2015-0114 ◽

2016 ◽

Vol 43 (2) ◽

pp. 144-152

Author(s):

Weidong Zhu ◽

Along Zhang ◽

Biao Mei ◽

Yinglin Ke

Keyword(s):

Design Methodology ◽

Detection Method ◽

High Accuracy ◽

Drilling Machine ◽

End Effector ◽

Content Type ◽

Aircraft Fuselage ◽

Position Detection ◽

Suppression Method ◽

Practical Implications

Purpose – A large number of fastener holes have to be drilled with high quality in the circumferential splice region during the assembly of aircraft fuselage. The purpose of this paper is to design an automatic stepping mechanism for a circumferential splice drilling machine, to meet the requirements of large workspace and high accuracy in drilling at the same time. Design/methodology/approach – A docking position detection method based on magnetic proximity sensors is proposed for the positioning of the arc-shaped rail with respect to the circumferential rails, which significantly improves the accuracy and reliability of automatic stepping. The slipping phenomenon of the end-effector is analyzed, and the optimized counter weights are used to eliminate the slipping and improve the working stability of the stepping mechanism. Findings – An automatic stepping mechanism is developed for the circumferential splice drilling machine, which comprises the docking position detection method and the elimination/suppression method of the end-effector’s slipping. Practical implications – The proposed automatic stepping mechanism has been integrated into the circumferential splice drilling machine for the fuselage assembly in an aircraft company in China. Originality/value – An automatic stepping scheme for the circumferential splice drilling machine is proposed, which enhances the efficiency in circumferential splice drilling in aircraft fuselage assembly.

Download Full-text

Scientific Value Weights more than Being Open or Toll Access: An analysis of the OA advantage in Nature and Science

Journal of Data and Information Science ◽

10.2478/jdis-2021-0033 ◽

2021 ◽

Vol 0 (0) ◽

Author(s):

Howell Y. Wang ◽

Shelia X. Wei ◽

Cong Cao ◽

Xianwen Wang ◽

Fred Y. Ye

Keyword(s):

Design Methodology ◽

Latent Dirichlet Allocation ◽

Web Of Science ◽

Pearson Correlation ◽

High Quality ◽

Qualitative Comparison ◽

Scientific Value ◽

Practical Implications ◽

The Web

Abstract Purpose We attempt to find out whether OA or TA really affects the dissemination of scientific discoveries. Design/methodology/approach We design the indicators, hot-degree, and R-index to indicate a topic OA or TA advantages. First, according to the OA classification of the Web of Science (WoS), we collect data from the WoS by downloading OA and TA articles, letters, and reviews published in Nature and Science during 2010–2019. These papers are divided into three broad disciplines, namely biomedicine, physics, and others. Then, taking a discipline in a journal and using the classical Latent Dirichlet Allocation (LDA) to cluster 100 topics of OA and TA papers respectively, we apply the Pearson correlation coefficient to match the topics of OA and TA, and calculate the hot-degree and R-index of every OA-TA topic pair. Finally, characteristics of the discipline can be presented. In qualitative comparison, we choose some high-quality papers which belong to Nature remarkable papers or Science breakthroughs, and analyze the relations between OA/TA and citation numbers. Findings The result shows that OA hot-degree in biomedicine is significantly greater than that of TA, but significantly less than that of TA in physics. Based on the R-index, it is found that OA advantages exist in biomedicine and TA advantages do in physics. Therefore, the dissemination of average scientific discoveries in all fields is not necessarily affected by OA or TA. However, OA promotes the spread of important scientific discoveries in high-quality papers. Research limitations We lost some citations by ignoring other open sources such as arXiv and bioArxiv. Another limitation came from that Nature employs some strong measures for access-promoting subscription-based articles, on which the boundary between OA and TA became fuzzy. Practical implications It is useful to select hot topics in a set of publications by the hot-degree index. The finding comprehensively reflects the differences of OA and TA in different disciplines, which is a useful reference when researchers choose the publishing way as OA or TA. Originality/value We propose a new method, including two indicators, to explore and measure OA or TA advantages.

Download Full-text

Consumer value creation through WhatsApp use

Academia Revista Latinoamericana de Administración ◽

10.1108/arla-02-2019-0044 ◽

2019 ◽

Vol 32 (4) ◽

pp. 455-471

Author(s):

Jorge Cruz-Cárdenas ◽

Jorge Guadalupe-Lanas ◽

Ekaterina Zabelina ◽

Andrés Palacio-Fierro ◽

Margarita Velín-Fárez ◽

...

Keyword(s):

Latin American ◽

Value Creation ◽

Emotional Support ◽

Design Methodology ◽

Instant Messaging ◽

Role Performance ◽

Content Type ◽

Consumer Value ◽

Depth Interviews ◽

Practical Implications

Purpose The purpose of this paper is to understand in-depth how consumers create value in their lives using WhatsApp, the leading mobile instant messaging (MIM) application. Design/methodology/approach The study adopts the perspective of customer-dominant logic (CDL) and uses a qualitative multimethod design involving 3 focus groups and 25 subsequent in-depth interviews. The research setting was Ecuador, a Latin American country. Findings Analysis and interpretation of the participants’ stories made it possible to identify and understand the creation of four types of value: maintaining and strengthening relationships; improving role performance; emotional support; and entertainment and fun. In addition, the present study proposes a conceptual model of consumer value creation as it applies to MIM. Practical implications Understanding the way consumers create value in their lives using MIM is important not only for organizations that offer MIM applications, but also for those companies that develop other applications for mobile phones or for those who wish to use MIM as an electronic word-of-mouth vehicle. Originality/value The current study is one of the first to address the topic of consumer behavior in the use of technologies from the perspective of CDL; this perspective enables an integrated qualitative vision of value creation in which the consumer is the protagonist.

Download Full-text

Advantages and potential challenges of data management in e-maintenance

Journal of Quality in Maintenance Engineering ◽

10.1108/jqme-03-2018-0018 ◽

2019 ◽

Vol 25 (3) ◽

pp. 378-396 ◽

Cited By ~ 3

Author(s):

Arian Razmi-Farooji ◽

Hanna Kropsu-Vehkaperä ◽

Janne Härkönen ◽

Harri Haapasalo

Keyword(s):

Data Management ◽

Design Methodology ◽

Management Practices ◽

Future Research ◽

Conceptual Approach ◽

Content Type ◽

Different Types ◽

Maintenance Systems ◽

Industry Leader ◽

Practical Implications

Purpose The purpose of this paper is twofold: first, to understand data management challenges in e-maintenance systems from a holistically viewpoint through summarizing the earlier scattered research in the field, and second, to present a conceptual approach for addressing these challenges in practice. Design/methodology/approach The study is realized as a combination of a literature review and by the means of analyzing the practices on an industry leader in manufacturing and maintenance services. Findings This research provides a general understanding over data management challenges in e-maintenance and summarizes their associated proposed solutions. In addition, this paper lists and exemplifies different types and sources of data which can be collected in e-maintenance, across different organizational levels. Analyzing the data management practices of an e-maintenance industry leader provides a conceptual approach to address identified challenges in practice. Research limitations/implications Since this paper is based on studying the practices of a single company, it might be limited to generalize the results. Future research topics can focus on each of mentioned data management challenges and also validate the applicability of presented model in other companies and industries. Practical implications Understanding the e-maintenance-related challenges helps maintenance managers and other involved stakeholders in e-maintenance systems to better solve the challenges. Originality/value The so-far literature on e-maintenance has been studied with narrow focus to data and data management in e-maintenance appears as one of the less studied topics in the literature. This research paper contributes to e-maintenance by highlighting the deficiencies of the discussion surrounding the perspectives of data management in e-maintenance by studying all common data management challenges and listing different types of data which need to be acquired in e-maintenance systems.

Download Full-text

An exploratory empirical study of whistleblowing and whistleblowers

Journal of Financial Crime ◽

10.1108/jfc-03-2020-0042 ◽

2020 ◽

Vol 27 (3) ◽

pp. 755-770

Author(s):

Maria Krambia-Kapardis

Keyword(s):

Statistical Analysis ◽

Design Methodology ◽

Research Topic ◽

Due Date ◽

Content Type ◽

Unethical Behaviour ◽

Eu Member States ◽

European Directive ◽

Original Survey ◽

Practical Implications

Purpose The purpose of this study is to develop a profile of whistleblowers and to determine whether whistleblowing legislation would encourage those individuals to bring to light some illegal or unethical behaviour that otherwise would remain in the shadows. Design/methodology/approach Having identified whistleblowing correlation, a survey was carried out in Cyprus of actual whistleblowers and could-have-been whistleblowers. Findings Males between 46 and55 years of age, regardless of whether they have dependents or hold senior positions in organizations are significantly more likely to blow the whistle. However, could-have-been whistleblowers did not go ahead because they felt that the authorities would not act on their information. Research limitations/implications Because of the sensitive nature of the research topic and the fact that only whistleblowers or intended whistleblowers could participate in the study, the sample size is limited as a result. This, in turn, limits both the number of respondents in each category (actual and intended) as well as constrains the statistical analysis that could be carried out on the data. Practical implications It remains to be seen whether EU Member States shall implement the European Directive 2019/1937 on the protection of persons who report breaches of Union Law, in its entirety by the due date, namely December 2021. Originality/value This study provides a literature review of whistleblowing and reports an original survey against the backdrop of the European Directive.

Download Full-text

How Phillips saw the light

Strategic Direction ◽

10.1108/sd-05-2020-0103 ◽

2020 ◽

Vol 36 (8) ◽

pp. 29-31

Keyword(s):

Design Methodology ◽

Reading Time ◽

Daily Basis ◽

Business World ◽

Content Type ◽

Freak Show ◽

The World ◽

Pertinent Information ◽

The One ◽

Practical Implications

Purpose Reviews the latest management developments across the globe and pinpoints practical implications from cutting-edge research and case studies. Design/methodology/approach This briefing is prepared by an independent writer who adds their own impartial comments and places the articles in context. Findings The problem with developing a reputation of being something of an oracle in the business world is that all of a sudden, everyone expects you to pull off the trick of interpreting the future on a daily basis. Like a freak show circus act or one-hit wonder pop singer, people expect you to perform when they see you, and they expect you to perform the thing that made you famous, even if it is the one thing in the world you don’t want to do. And when you fail to deliver on these heightened expectations, you are dismissed as a one trick pony, however good that trick is in the first place. Originality/value The briefing saves busy executives and researchers hours of reading time by selecting only the very best, most pertinent information and presenting it in a condensed and easy-to-digest format.

Download Full-text