Multilingual Text Mining

Author(s):  
Peter A. Chew

The principles of text mining are fundamental to technology in everyday use. The world wide web (WWW) has in many senses driven research in text mining, and with the growth of the WWW, applications of text mining (like search engines) have by now become commonplace. In a way that was not true even less than a decade ago, it is taken for granted that the ‘needle in the haystack’ can quickly be found among large volumes of text. In most cases, however, users still expect search engines to return results in the same language as that of the query, perhaps the language best understood by the user, or the language in which text is most likely to be available. The distribution of languages on the WWW does not match the distribution of languages spoken in general by the world’s population. For example, while English is spoken by under 10% of the world’s population (Gordon 2005), it is still predominant on the WWW, accounting for perhaps two-thirds of documents. There are variety of possible reasons for this disparity, including technological inequities between different parts of the world and the fact that the WWW had its genesis in an English-speaking country. Whatever the cause for the dominance of English, the fact that two-thirds of the WWW is in one language is, in all likelihood, a major reason that the concept of multilingual text mining is still relatively new. Until recently, there simply has not been a significant and widespread need for multilingual text mining. A number of recent developments have begun to change the situation, however. Perhaps these developments can be grouped under the general rubric of ‘globalization’. They include the increasing adoption, use, and popularization of the WWW in non-Englishspeaking societies; the trend towards political integration of diverse linguistic communities (highly evident, for example, in the European Union); and a growing interest in understanding social, technological and political developments in other parts of the world. All these developments contribute to a greater demand for multilingual text processing – essentially, methods for handling, managing, and comparing documents in multiple languages, some of which may not even be known to the end user.

Author(s):  
Charles McCombie ◽  
Neil Chapman

The growing importance of finding shared solutions for countries with small arisings of long-lived wastes from nuclear power generation has been increasingly recognised over the last few years. In this paper, we take a systematic look at recent developments focusing in turn on: • international initiatives (IAEA); • regional initiatives the European Union (EU); • national positions for in 3 categories of countries: • those participating in the Arius association that was founded explicitly to promote the multinational concept; • those with specific interests in shared solutions; • those that might consider hosting a repository; • those with laws or policies requiring national disposal. Multinational concepts continue to face challenges that are more difficult than for national programmes. Nevertheless, progress to date indicates that the nuclear communities of the world may well be on the way to optimising radioactive waste management on a global scale.


Author(s):  
Ishan Sudeera Abeywardena ◽  
Chee Seng Chan ◽  
Choy Yoong Tham

<p>The open educational resources (OER) movement has gained momentum in the past few years. With this new drive towards making knowledge open and accessible, a large number of OER repositories have been established and made available online throughout the world. However, the inability of existing search engines such as Google, Yahoo!, and Bing to effectively search for useful OER which are of acceptable academic standard for teaching purposes is a major factor contributing to the slow uptake of the entire movement. As a major step towards solving this issue, this paper proposes <em>OERScout</em>, a technology framework based on text mining solutions. The objectives of our work are to (i) develop a technology framework which will parametrically measure the usefulness of an OER for a particular academic purpose based on the openness, accessibility, and relevance attributes; and (ii) provide academics with a mechanism to locate OER which are of an acceptable academic standard. From our user tests, we have identified that OERScout is a sound solution for effectively zeroing in on OER which can be readily used for teaching and learning purposes.</p>


2018 ◽  
Vol 7 (4.35) ◽  
pp. 276
Author(s):  
Mohammed Najah Mahdi ◽  
Abdul Rahim Ahmad ◽  
Roslan Ismail

The volume of information available on the World Wide Web is quite significant. This plethora of information has, to a large extent constantly challenged researchers in seeking for ways of making the information easily accessible to end users in as convenient ways as possible. Characteristically, ensuring the accessibility of this large amount of information involves three all-encompassing processes of retrieval, organization and presentation. The World Wide Web offers a platform for sharing information from large database repositories globally. However, information needs to be searched with specialized tools commonly referred to as search engines. While a number of search engines does presently exist, most of these search engines are noted for their inability to retrieve information usable to the end user. It is therefore critical for the results generated from search engines to be intelligently organized for the optimal usefulness of the information to the searcher. Exploratory web technologies is capable of filing this gap. Therefore, this paper reviews exploratory search as a mechanism for conducting result-oriented search. It also reviews the ways of evaluating the search results obtained from an exploratory search.


1935 ◽  
Vol 29 (5) ◽  
pp. 785-804 ◽  
Author(s):  
Richard Hartshorne

The border position of geography between the natural and the social sciences is fairly generally recognized. Concerned primarily with differences in the different areas of the world, geography studies both natural and cultural features. In some universities, it is included among the natural sciences, in others among the social sciences. In England and America, geographers have particularly cultivated that portion of their field which leads naturally into economics, i.e., economic geography. Much less attention has been paid to the relations with history, although various geographers and historians have studied what has variously been called historical geography or geographic history. Even less have geographers in the English-speaking countries concerned themselves with that portion of their subject which bears upon the political areas of the world. The territorial problems of the war and postwar period, however, stimulated activity in this field both in England and America, the most notable product of which is Bowman's The New World, consisting in large part of the materials gathered for the American Commission to the Peace Conference.


Author(s):  
Bill D. Herman

The volume of information on the Internet is incomprehensible and growing exponentially. With such a vast ocean of information available, search engines have become an indispensible tool for virtually all users. Yet much of what is available online is potentially objectionable, controversial, or harmful. This leaves search engines in a potentially precarious position, simultaneously wanting to maximize the usefulness of results for end users while also minimizing political, regulatory, civil, and even criminal difficulties in the jurisdictions where they operate. Conversely, the substantial logistical and legal obstacles to regulating Internet content also leave policymakers in an unenviable position, and content that the public or policymakers may well want regulated—even that which is patently illegal—can remain virtually impossible to stamp out. The policies that may affect online search are incredibly varied, including contract law, laws that affect expression and media producers more generally, copyright, fraud, privacy, and antitrust. For the most part, the law that applies was developed in and will still apply to offline contexts as well. Internet search is still an area filled with its own vexing policy questions. In many cases, these are questions of secondary liability—addressing whether the search provider is liable for search results that link to websites that are beyond their control. In other areas, though, the behavior of search providers will endure specific scrutiny. While many of these questions could be or actually are asked in countries around the world, this article focuses primarily on the legal regimes in the United States and the European Union.


Author(s):  
Hamza Çeştepe ◽  
Tamer Güven

In this study, the structure and level of intra-regional trade in Economic Cooperation Organization (ECO), which is a movement of regional cooperation founded in the Western and Central Asia region, has been analyzed. In ECO, with a share below its potential at the world trade, intra-regional trade is low relative to other regional integrations such as the European Union. However, the volume of intra-regional trade in the region countries, except a few countries, tend to increase in recent years. In this study, as a result of the evaluation made by the indices calculated, it was found that the region countries is in the position of more complementary economies in context of foreign trade; the countries has generally a high trade intensity with its neighbors; intra-industry trade in the region is at low level. As a result, although the level and structure of intra-regional trade in ECO region seems to be disincentive for the transformation to regional integration as of today, recent developments suggest that this obstacle will gradually diminish in the future. In addition, if some of advantages and potential of the region in terms of regional integration can be valued it does not seem very difficult to reach more advanced stages of integration for this cooperation movement.


2019 ◽  
pp. 3-6
Author(s):  
D. A. Bogdanova

The article provides an overview of the activities of the European Union Forum on kids' safety in Internet — Safer Internet Forum (SIF) 2019, which was held in Brussels, Belgium, in November 2019. The current Internet risks addressed by the World Wide Web users, especially children, are described.


2018 ◽  
pp. 113-119
Author(s):  
Gennady Ya. Vagin ◽  
Eugene B. Solntsev ◽  
Oleg Yu. Malafeev

The article analyses critera applying to the choice of energy efficient high quality light sources and luminaires, which are used in Russian domestic and international practice. It is found that national standards GOST P 54993–2012 and GOST P 54992– 2012 contain outdated criteria for determining indices and classes of energy efficiency of light sources and luminaires. They are taken from the 1998 EU Directive #98/11/EU “Electric lamps”, in which LED light sources and discharge lamps of high intensity were not included. A new Regulation of the European Union #874/2012/EU on energy labelling of electric lamps and luminaires, in which these light sources are taken into consideration, contains a new technique of determining classes of energy efficiency and new, higher classes are added. The article has carried out a comparison of calculations of the energy efficiency classes in accordance with GOST P 54993 and with Regulation #874/2012/EU, and it is found out that a calculation using GOST P 54993 gives underrated energy efficiency classes. This can lead to interdiction of export for certain light sources and luminaires, can discredit Russian domestic manufacturer light sources and does not correspond to the rules of the World Trade Organization (WTO).


2003 ◽  
Vol 2 (1) ◽  
pp. 33-50
Author(s):  
Göran Gunner

Authors from the Christian Right in the USA situate the September 11 attack on New York and Washington within God's intentions to bring America into the divine schedule for the end of the world. This is true of Pat Robertson and Jerry Falwell, and other leading figures in the ‘Christian Coalition’. This article analyses how Christian fundamentalists assess the roles of the USA, the State of Israel, Islam, Iraq, the European Union and Russia within what they perceive to be the divine plan for the future of the world, especially against the background of ‘9/11’. It argues that the ideas of the Christian Right and of President George W. Bush coalesce to a high degree. Whereas before 9/11 many American mega-church preachers had aspirations to direct political life, after the events of that day the President assumes some of the roles of a mega-religious leader.


Sign in / Sign up

Export Citation Format

Share Document