Extraction of Web Content Based on Content Type

Author(s):  
Manish Kumar Verma ◽  
Sarowar Kumar ◽  
Kumar Abhishek ◽  
M. P. Singh
Keyword(s):  
2021 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Lorelei Ortiz

PurposeThis study examines comprehensiveness and responsiveness of mission statements for the top 100 retailers on the 2020 National Retailers Federation list in order to (1) evaluate how effectively they communicate organizational identity, values and purpose, (2) underscore a distinctive commitment to stakeholders and (3) what extent these efforts are reflected in revised mission statements or addenda to meet global pandemic challenges.Design/methodology/approachThe study employs a 4-question metric to measure comprehensiveness and a two-pronged qualitative method of analysis consisting of keyword searches followed by content analysis.FindingsRetailer statements are considerably comprehensive in describing purpose and audience yet very few articulate stakeholder value, differentiate themselves as distinctive or substantively reaffirm their core mission and values. Retailers seem more invested in strategic communication around diversity, equity and inclusion, based on web content in their consumer, job seeker and investor touchpoints.Research limitations/implicationsCoding and interpreting language through content analysis methods may introduce some level of subjectivity, particularly when dealing with unstructured data. Implications for how organizations acclimated in order to survive and thrive, while maintaining focus on stakeholders and strategy. Examining organizational mission statements and their contexts yields perspective into how organizations define themselves and what they do during times of crisis.Originality/valueThis study provides insights into the content, structure and functions of the statements against a specific comprehensiveness metric and reveals patterns about the texts and their contexts during a pandemic and strong cultural and societal movements.


2021 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Martin Lněnička ◽  
Renata Machova ◽  
Jolana Volejníková ◽  
Veronika Linhartová ◽  
Radka Knezackova ◽  
...  

PurposeThe purpose of this paper was to draw on evidence from computer-mediated transparency and examine the argument that open government data and national data infrastructures represented by open data portals can help in enhancing transparency by providing various relevant features and capabilities for stakeholders' interactions.Design/methodology/approachThe developed methodology consisted of a two-step strategy to investigate research questions. First, a web content analysis was conducted to identify the most common features and capabilities provided by existing national open data portals. The second step involved performing the Delphi process by surveying domain experts to measure the diversity of their opinions on this topic.FindingsIdentified features and capabilities were classified into categories and ranked according to their importance. By formalizing these feature-related transparency mechanisms through which stakeholders work with data sets we provided recommendations on how to incorporate them into designing and developing open data portals.Social implicationsThe creation of appropriate open data portals aims to fulfil the principles of open government and enables stakeholders to effectively engage in the policy and decision-making processes.Originality/valueBy analyzing existing national open data portals and validating the feature-related transparency mechanisms, this paper fills this gap in existing literature on designing and developing open data portals for transparency efforts.


2021 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Noha A. Nagy ◽  
Amira S.N. Tawadros ◽  
Amal S. Soliman

Purpose This paper aims at understanding the dynamics underlying toleration as a complex social phenomenon and its pattern on Facebook during the June 30th revolution in Egypt. Thanks to the huge advances in ICT, internet-mediated research (IMR) has become one of the most prominent research methodologies in social sciences. Discussions on social network sites cannot be neglected in studying the dynamics complex and emerging social phenomena such as changes in public opinion, culture, attitudes and virtues. Design/methodology/approach To fulfill this aim, the researchers used web content analysis as a method inside IMR paradigm to analyze the discussions on Tamarrod’s Facebook page in the period from June 30th to July 5th and to examine the emerging overall pattern of toleration. Findings The results show indications that toleration is inherent in the Egyptian culture, and that the Egyptian society still keeps its reputation as a highly tolerant society, even in crises periods where tensions are witnessed everywhere. Moreover, the results also show that the web content analysis process proposed in this study is highly reliable and valid. Originality/value The importance of the study lies in introducing a computational and empirical approach to analyze web content in a semi-automated way and proving its validity and reliability to study social phenomena such as toleration.


2021 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Gennaro Maione ◽  
Daniela Sorrentino ◽  
Alba Demneri Kruja

Purpose At exceptional times, governments are entrusted with greater authority. This creates significant concerns over governments’ transparency and accountability. This paper aims to pursue a twofold objective: assessing the patterns of open government data during the extraordinary time initiated by the COVID-19 pandemic drawing relevant policy and managerial implications regarding the future development of open data as a mechanism of accountability at times of exception. Design/methodology/approach The study follows exploratory research, relying on a web content analysis. The empirical setting is provided by 20 Italian regional governments during the COVID-19 pandemic as a shock that has triggered an exceptional time for governments. Findings Results on the desirable (extrinsic and intrinsic) characteristics of the data analyzed show that in the empirical setting investigated, open data does not enable to properly address the accountability concerns of a demanding forum at times of exception. Research limitations/implications The paper enriches the state of the art on accountability and provides both scholars and practitioners (e.g. policymakers, managers, etc.) a current reading of data-driven orientation as a stimulus to the accountability of public administrations during exceptional times. Originality/value The paper investigates open data as a condition of public accountability, assessing whether and how Italian regional governments have concretely opened their data to enable their forums to elaboration of an informed opinion about their conduct during the ongoing pandemic. This fosters the understanding of how accountability is deployed in times of exception in light of the possibilities offered by the availability of online platforms.


2020 ◽  
Vol 21 (3) ◽  
pp. 121-145
Author(s):  
Subhajit Panda ◽  
Rupak Chakravarty

PurposeThe purpose of this paper is to investigate and identify the status of Web Content Accessibility Guidelines (WCAG) conformance levels (A, AA, AAA) and accessibility status in terms of Severity (Error, Warning and Review) and Responsibility (Editor, Webmaster and Developer) of Indian Institutes of Technology (IIT) Library websites based on Siteimprove Software-as-a-Service (SaaS) platform.Design/methodology/approachThe library websites of IITs were tested using Siteimprove web-tool to gather details pertaining to W3C's WCAG 2.1 standards. The data thus obtained were then visualized using spreadsheet software for greater insight. A partial correlation test was also done to assess the relationship between the three conformance levels.FindingsThe study could identify significant accessibility-related limitations of the IIT library websites concerning the three WCAG 2.1 conformance levels A (max IIT Bombay), AA (max IIT Dhanbad (ISM)) and AAA (max IIT Gandhinagar and IIT Varanasi (BHU)), Severity and Responsibility. A positive linear relationship exists amongst these conformance levels. The mean value of conformance levels were found to be 18.3 (A), 2.2 (AA) and 3.1 (AAA); Severity scores were found to be 14.4 (Error), 3.9 (Warning) and 5.2 (Review); and Responsibility scores were found to be 6 (Editor), 9.3 (Webmaster) and 8.3 (Developer), respectively.Practical implicationsThe study highlights the comparative picture of accessibility issues and conformance levels of the IITs' library website homepage with the help of results derived/based on Siteimprove Accessibility Checker. The findings of the study reveal that though the library website of IITs' in India possess a well-designed and easily navigable website homepage as far as their accessibility for VIPs is concerned, there are several issues that are still to be resolved.Social implicationsWorld Intellectual Property Organization's (WIPO) Marrakesh VIP Treaty (MVT) and the W3C's WCAG cater to the requirements and rights of the persons with vision-related disability of accessing information and knowledge building a steeper and deeper knowledge divide. Identifying and rectifying the shortcomings in the library websites will bridge the accessibility-divide and make the society more inclusive.Originality/valueNo previous study could be identified evaluating the accessibility issues of the library website of Indian IITs focussed on vision-disabled persons using Siteimprove. The methodology and approach of this paper have value in terms of reusability and reproducibility facilitating future studies.


2020 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Chunmin Lang ◽  
Sibei Xia ◽  
Chuanlan Liu

PurposeThis study intends to examine consumers' fashion customization experiences through a web content mining (WCM) approach. By applying the theory of customer value, this study explores the benefits and costs of two levels of mass customization (MC) to identify the values derived from style (i.e. shoe customization) and fit customization experiences (i.e. apparel customization) and further to compare the dominating dimensions of value derived across style and fit customization.Design/methodology/approachA WCM approach was applied. Also, two case studies were conducted with one focusing on style customization and the other focusing on fit customization. The brand Vans was selected to examine style customization in study 1. The brand Sumissura was selected to examine fit customization in study 2. Consumers' comments on customization experiences from these two brands were collected through social networks, respectively. After data cleaning, 394 reviews for Vans and 510 reviews for Sumissura were included in the final data analysis. Co-occurrence plots, feature extraction and grouping were used for the data analysis.FindingsThe emotional value was found to be the major benefit for style customization, while the functional value was indicated as the major benefit for fit customization, followed by ease of use and emotional value. In addition, three major themes of costs, including unsatisfied service, disappointing product performance and financial risk, were revealed by excavating and evaluating consumers' feedback of their actual clothing customization experiences with Sumissura.Originality/valueThis study initiates the effort to use web mining, specifically, the WCM approach to thoroughly investigate the benefits and costs of MC through real consumers' feedback of two different types of fashion products. The analysis of this study also reflects the levels of customization: style and fit. It provides an in-depth text analysis of online MC consumers' feedback through the use of feature extraction analysis and word co-occurrence networks.


2015 ◽  
Vol 33 (1) ◽  
pp. 35-51 ◽  
Author(s):  
Anusha Lakmini Wijayaratne ◽  
Diljit Singh

Purpose – The purpose of this paper is to introduce a library website model. Further, the paper discusses a designer’s checklist and an evaluative instrument that were constructed based on the proposed model. Design/methodology/approach – The model was developed through a Delphi study that was participated by two panels of experts. The researcher communicated with the panel members via e-mail using two Delphi instruments designed out of two item pools that were developed based on the knowledge gained from surveying the literature, visiting the selected libraries and exploring the library websites. Then, a designer’s checklist and an evaluative instrument were derived from the proposed model through a series of brainstorming sessions. Findings – The proposed model consisted of altogether 140 items (60 web content elements and 80 web design features). The designer’s checklist comprises all 140 items, and the evaluative instrument comprises 60 content elements and 57 design features. Research limitations/implications – This study has developed an academic library website model and derived two instruments based on the proposed model. Further studies are needed to customize, particularly, the web content pillar of this conceptual model, to meet the specific needs of different types of libraries including public libraries, special libraries, school libraries, etc. Practical implications – The designer’s checklist and the evaluative instrument derived from the proposed model are useful tools for library professionals in designing, re-designing, maintaining and evaluating their library websites. The librarians may use these tools for both institutional and research purposes. Originality/value – The model and the two instruments proposed by this study are unique in focus, origin, content and presentation.


2015 ◽  
Vol 33 (4) ◽  
pp. 526-544 ◽  
Author(s):  
Doralyn Rossmann ◽  
Scott W.H. Young

Purpose – Social Media Optimization (SMO) offers guidelines by which libraries can design content for social shareability through social networking services (SNSs). The purpose of this paper is to introduce SMO and discuss its effects and benefits for libraries. Design/methodology/approach – Researchers identified and applied five principles of SMO. Web analytics software provides data on web site traffic and user engagement before and after the application of SMO. Findings – By intentionally applying a program of SMO, the library increased content shareability, increased user engagement, and built community. Research limitations/implications – Increasing use of SNSs may influence the study results, independent of SMO application. Limitations inherent to web analytics software may affect results. Further study could expand analysis beyond web analytics to include comments on SNS posts, SNS shares from library pages, and a qualitative analysis of user behaviors and attitudes regarding library web content and SNSs. Practical implications – This research offers an intentional approach for libraries to optimize their online resources sharing through SNSs. Originality/value – Previous research has examined the role of community building and social connectedness for SNS users, but none have discussed using SMO to encourage user engagement and interactivity through increased SNS traffic into library web pages.


2014 ◽  
Vol 27 (2) ◽  
pp. 208-228 ◽  
Author(s):  
Faouzi Kamoun ◽  
Mohamed Basel Almourad

Purpose – The purpose of this paper is to examine the extent to which accessibility is taken into account in the assessment and ranking of e-government web sites through the lens of a specific study related to Dubai e-government. Design/methodology/approach – The paper considers a case study related to Dubai e-government and it evaluates the accessibility of each of the 21 Dubai e-government web sites, based on the Web Content Accessibility Guidelines (WCAG) 2.0 and using an automated accessibility testing tool. A bivariate correlation analysis is performed to assess the correlation between web site ranking and accessibility score. Findings – The research reveals that contrary to common intuition and some earlier studies, there is a weak correlation between e-government web site ranking score and web site accessibility. Research limitations/implications – The paper uses an accessibility metric that is a proxy indicator of web accessibility and is not a real assessment of accessibility as experienced by a person with disability. Practical implications – When re-examined through the lens of Rawls's moral theory, this research suggests that accessibility should be given a higher priority in the general evaluation and ranking of e-government web sites. Social implications – The paper promotes universal accessibility to e-government information and services. Originality/value – The paper uses ethical arguments to highlight the need to comprehensively consider accessibility as a major criterion in the assessment and ranking of e-government web sites.


2016 ◽  
Vol 26 (4) ◽  
pp. 901-918 ◽  
Author(s):  
Mirza Muhammad Naseer ◽  
Khalid Mahmood

Purpose – The purpose of this paper is to explore the use of political party websites for e-electioneering and their impact on the outcome of the elections. Design/methodology/approach – Empirical data for the study were collected from the websites of 11 major political parties of Pakistan using modified version of the coding scheme used by Gibson, Rommele and Ward for the evaluation of functionality and delivery of websites. Data were analysed using web content analysis method to achieve the objectives of this study. The study also ranked the party websites based on points scored for functionalities and delivery. Findings – The study found that although Pakistani political parties have started using their websites for communication with their voters during the general elections but they have not utilized the full potential of the website functionalities for e-electioneering. Research limitations/implications – The study focused on content analysis of political party websites of Pakistan only. However, comparisons were made to other studies where possible to contextualize the results of this study in international perspective. It is suggested to replicate this study after ten years to study the changing behaviour of political parties. Practical implications – Political parties might like to improve their websites in the light of findings of this study to spread their message more effectively to larger voter base. Social implications – Findings of the study will help in improving the readiness of political parties for e-electioneering and improved websites will help voters in making an informed decision during election. It will overall improve the electoral process in the country where democratic system is not very strong. Originality/value – With the advent of internet, political parties are using their websites during elections for various purposes. This study, first ever in Pakistan on the topic, provides empirical evidence on the use of political party websites during May 2013 general election in Pakistan and presents its impact on the outcome of the election. The study will be valuable for political science researchers especially those focusing on Asia and Pakistan.


Sign in / Sign up

Export Citation Format

Share Document