scholarly journals Interactive Data Analytics for the Humanities

Author(s):  
Iryna Gurevych ◽  
Christian M. Meyer ◽  
Carsten Binnig ◽  
Johannes Fürnkranz ◽  
Kristian Kersting ◽  
...  
2018 ◽  
Vol 9 (3) ◽  
pp. 383-395 ◽  
Author(s):  
Jianlong Zhou ◽  
Syed Z. Arshad ◽  
Xiuying Wang ◽  
Zhidong Li ◽  
Dagan Feng ◽  
...  

Author(s):  
Johan L Perols ◽  
Ann C Dzuranin

Accounting firms are making significant investments in audit data analytics technologies to modernize their audit services and the audit profession is believed to now be on the verge of a transformation (BDO 2016; Deloitte 2016; EY 2015; Forbes Insights 2015; PwC 2015). In particular, the firms are emphasizing newer technologies such as interactive data visualization (BDO 2016; Deloitte 2016; PwC 2016) and they are increasingly expecting students to have data analytics skills (Forbes Insights 2015; PwC 2015). In this case you take on the role of Bryan, an audit senior assigned to Acme. Brian has been tasked with using interactive data visualization to gain an understanding of Acme’s sales and perform an initial evaluation of two fraud risks identified during a fraud brainstorming session.  Brian has been given a data file with over 250,000 financial transactions and five master tables that he is supposed to analyze using interactive data visualization.


2019 ◽  
Vol 214 ◽  
pp. 07020
Author(s):  
Diogo Castro ◽  
Prasanth Kothuri ◽  
Piotr Mrowczynski ◽  
Danilo Piparo ◽  
Enric Tejedor

This talk is about sharing our recent experiences in providing data analytics platform based on Apache Spark for High Energy Physics, CERN accelerator logging system and infrastructure monitoring. The Hadoop Service has started to expand its user base for researchers who want to perform analysis with big data technologies. Among many frameworks, Apache Spark is currently getting the most traction from various user communities and new ways to deploy Spark such as Apache Mesos or Spark on Kubernetes have started to evolve rapidly. Meanwhile, notebook web applications such as Jupyter offer the ability to perform interactive data analytics and visualizations without the need to install additional software. CERN already provides a web platform, called SWAN (Service for Web-based ANalysis), where users can write and run their analyses in the form of notebooks, seamlessly accessing the data and software they need. The first part of the presentation talks about several recent integrations and optimizations to the Apache Spark computing platform to enable HEP data processing and CERN accelerator logging system analytics. The optimizations and integrations, include, but not limited to, access of kerberized resources, xrootd connector enabling remote access to EOS storage and integration with SWAN for interactive data analysis, thus forming a truly Unified Analytics Platform. The second part of the talk touches upon the evolution of the Apache Spark data analytics platform, particularly sharing the recent work done to run Spark on Kubernetes on the virtualized and container-based infrastructure in Openstack. This deployment model allows for elastic scaling of data analytics workloads enabling efficient, on-demand utilization of resources in private or public clouds.


Author(s):  
Dorota Glowacka ◽  
Evalgelos Milios ◽  
Axel J. Soto ◽  
Fernando V. Paulovich ◽  
Dennis Parra ◽  
...  

Author(s):  
Dorota Glowacka ◽  
Evangelos Milios ◽  
Axel J Soto ◽  
Osnat Mokryn ◽  
Fernando V Paulovich ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document