Web-Based Semantic Analysis of Chinese News Video

Author(s):  
Huamin Feng ◽  
Zongqiang Pang ◽  
Kun Qiu ◽  
Guosen Song
Author(s):  
Fion S.L. Lee ◽  
Kelvin C.K. Wong ◽  
William K.W. Cheung ◽  
Cynthia F.K. Lee

This chapter describes the use of a Web-based essay critiquing system and its integration into in a series of composition workshops for a group of secondary school students in Hong Kong. It begins with a review and application of the hybrid learning approach, followed by a description of latent semantic analysis, a methodology for corpus preparation. Then, the distribution computing architecture for essay critiquing system is described. It explicates the way in which the system is integrated with a writing pedagogy implemented in the workshop and the feasibility evaluation result is derived. The positive result confirms the benefits of hybrid learning.


2019 ◽  
Vol 8 (3) ◽  
pp. 5171-5175

Text summarization plays an important role in analysis of large set of data. It can be use in online text analysis and knowledge representation. Semantic text summarization plays a vital role to handle big data as data is in very large size, dynamic in nature and heterogeneity. In this paper I have proposed a novel model of knowledge-based semantic analysis for text summarization of web-based dynamic text data with help of FP-tree (Frequent Pattern tree). This model is free from ontology to find out semantic representation. The model consists of two phases. In the first phase benchmark web text data in terrorism domain is collected for construction of domain knowledge representation using FP-tree. Preprocessing is performed to reduce size and handle synonyms. In the second phase, Online articles/news are collected from different sources. Then using the domain knowledge representation, the summary of the web based large text data is extracted.


Author(s):  
Maciej Eder ◽  
Maciej Piasecki ◽  
Tomasz Walkowiak

An open stylometric system based on multilevel text analysisStylometric techniques are usually applied to a limited number of typical tasks, such as authorship attribution, genre analysis, or gender studies. However, they could be applied to several tasks beyond this canonical set, if only stylometric tools were more accessible to users from different areas of the humanities and social sciences. This paper presents a general idea, followed by a fully functional prototype of an open stylometric system that facilitates its wide use through to two aspects: technical and research flexibility. The system relies on a server installation combined with a web-based user interface. This frees the user from the necessity of installing any additional software. At the same time, the system offers a variety of ways in which the input texts can be analysed: they include not only the usual lexical level, but also deep-level linguistic features. This enables a range of possible applications, from typical stylometric tasks to the semantic analysis of text documents. The internal architecture of the system relies on several well-known software packages: a collection of language tools (for text pre-processing), Stylo (for stylometric analysis) and Cluto (for text clustering). The paper presents: (1) The idea behind the system from the user’s perspective. (2) The architecture of the system, with a focus on data processing. (3) Features for text description. (4) The use of analytical systems such as Stylo and Cluto. The presentation is illustrated with example applications. Otwarty system stylometryczny wykorzystujący wielopoziomową analizę języka Zastosowania metod stylometrycznych na ogół ograniczają się do kilku typowych problemów badawczych, takich jak atrybucja autorska, styl gatunków literackich czy studia nad zróżnicowaniem stylistycznym kobiet i mężczyzn. Z pewnością dałoby się je z powodzeniem zastosować również do wielu innych problemów klasyfikacji tekstów, gdyby tylko owe metody oraz odpowiednie narzędzia były bardziej dostępne dla uczonych reprezentujących różne dyscypliny nauk humanistycznych i społecznych. Artykuł niniejszy omawia założenia teoretyczne oraz w pełni funkcjonalny prototyp otwartego systemu stylometrycznego, którego szerokie zastosowanie umożliwią dwie jego cechy: elastyczność techniczna oraz dostosowywalność do różnych pytań badawczych. System opiera się na instalacji serwerowej sprzęgniętej z sieciowym interfejsem użytkownika. Uwalnia to użytkownika od konieczności instalowania jakichkolwiek dodatkowych programów. Jednocześnie system oferuje wiele sposobów analizowania tekstów nie tylko na poziomie leksykalnym, lecz także poprzez cechy językowe niskiego poziomu. Daje to możliwość stosowania systemu na wiele różnych sposobów, od typowych testów stylometrycznych do analizy semantycznej dokumentów. Wewnętrzna architektura systemu składa się z wielu elementów znanych ze swej funkcjonalności, w tym z pakietu Stylo przeznaczonego do analiz stylometrycznych oraz pakietu Cluto służącego do zaawansowanej analizy skupień. Artykuł omawia: (1) Koncepcję całego systemu, postrzeganą z punktu widzenia użytkownika, (2) Architekturę systemu oraz jego elementy odpowiedzialne za przetwarzanie tekstu, (3) Cechy językowe służące do opisu dokumentów, (4) Zastosowanie modułów analizy danych, takich jak Stylo czy Cluto. W artykule zostały też przedstawione przykładowe zastosowania systemu.


Author(s):  
Ramin Sabbagh ◽  
Farhad Ameri

The descriptions of capabilities of manufacturing companies can be found in multiple locations including company websites, legacy system databases, and ad hoc documents and spreadsheets. The capability descriptions are often represented using natural language. To unlock the value of unstructured capability information and learn from it, there is a need for developing advanced quantitative methods supported by machine learning and natural language processing techniques. This research proposes a multi-step unsupervised learning methodology using K-means clustering and topic modeling techniques in order to build clusters of suppliers based on their capabilities, extract and organize the manufacturing capability terminology, and discover nontrivial patterns in manufacturing capability corpora. The capability data is extracted either directly from the website of manufacturing firms or from their profiles in e-sourcing portals and directories. Feature extraction and dimensionality reduction process in this work in supported by Ngram extraction and Latent Semantic Analysis (LSA) methods. The proposed clustering method is validated experimentally based a dataset composed of 150 capability descriptions collected from web-based sourcing directories such as the Thomas Net directory for manufacturing companies. The results of the experiment show that the proposed method creates supplier cluster with high accuracy.


2014 ◽  
pp. 272-281
Author(s):  
Yuriy Semchyshyn ◽  
Ivan Kulpa ◽  
Igor Kolosovskyi ◽  
Oleksandr Hrechnikov ◽  
Petro Hayda

The paper considers semantic networks. It is a way of representing knowledge as a set of nodes-concepts connected by edges-relations. The history and current state-of-the-art of semantic networks is analyzed. The experience of designing and developing semantic network management system is described in four sections covering the following topics: data structures design, semantic analysis algorithm implementation, graph visualization methods selection and Web-based user interface development. The developed system is fully-operational tool for building and studying semantic networks. The system will be used for the further research.


2017 ◽  
Author(s):  
Klimis Ntalianis ◽  
Jahna Otterbacher ◽  
Nikolaos Mastorakis

2021 ◽  
Vol 14 ◽  
pp. 256-268
Author(s):  
Yanxin Chen ◽  
Qinling Jing

The corpus adopted in this study is from the official news texts of Chinese and foreign network media collected and processed by researchers. By Voyant, a web-based text reading and analysis platform, the study finds and analyzes the semantic differences of lexical chunk Chinese culture in Chinese and foreign news stories under the semantic view of systematic-functional grammar with the digital humanistic mode “distant reading” as the semantic analysis research means. the study explores the implicit semantic deviation and its logical semantic relationship between Chinese and foreign news texts.


1998 ◽  
Vol 62 (9) ◽  
pp. 671-674
Author(s):  
JF Chaves ◽  
JA Chaves ◽  
MS Lantz
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document