Information ◽  
2018 ◽  
Vol 9 (7) ◽  
pp. 171
Author(s):  
Alexandru Telea ◽  
Andreas Kerren

Recent developments at the crossroads of data science, datamining,machine learning, and graphics and imaging sciences have further established information visualization and visual analytics as central disciplines that deliver methods, techniques, and tools for making sense of and extracting actionable insights and results fromlarge amounts of complex,multidimensional, hybrid, and time-dependent data.[...]


2020 ◽  
Author(s):  
Johanna Schmidt

The need to use data visualization and visual analysis in various fields has led to the development of feature-rich standalone applications such as Tableau and MS Power BI. These applications provide ready-to-use functionality for loading, analyzing and visualizing data, even for users who are not familiar with programming and scripting. Meanwhile, data scientists have to combine many different tools and techniques in their daily work, since no standalone application can yet cover the entire workflow. As a result, a rich landscape of open source libraries is available today, covering various tasks from data analysis to modeling and visualization. To combine the best of two worlds, interfaces for scripting languages have been integrated into standalone applications in recent years. We analyzed which interfaces to six common scripting languages are offered. The interfaces offer different levels of integration and therefore support different steps of the data science workflow. In this paper we investigated the integration levels of script languages in standalone applications and divided them into four groups. We used this classification to evaluate 13 standalone visual analysis applications currently available on the market. We then analyzed which groups of applications best support which steps in the data science workflow. We found that a tight integration of scripting languages can especially support the explorative analysis and modeling phase of the data science workflow. We also discuss our results in the light of visual analysis research and give suggestions for future research directions.


2021 ◽  
Author(s):  
Stefan Van Aelst ◽  
Patrick J. F. Groenen

The Journal of Data Science, Statistics, and Visualisation (JDSSV) is an electronic journal which welcomes contributions to data science, statistics, and visualisation, and in particular, those aspects which link and integrate these subject areas. Articles can cover topics such as machine learning and statistical learning, the visualisation and verbalisation of data, visual analytics, big data infrastructures and analytics, interactive learning, and advanced computing. Articles thatdiscuss two or more research areas of the journal are favoured. Scientific contributions should be of a high standard. Articles should be oriented towards a wide scientific audience of statisticians, data scientists, computer scientists, data analysts, etc. The journal welcomes original contributions that are not being considered for publication elsewhere and contain a high level of novelty. Articles with a thorough but concise review of a certain topic with the potential to provide new insights are also welcome. Manuscripts submitted to the journal generally are accompanied by supplementary material containing software code, data, technical derivations or detailed explanations, additional examples, etc. All submitted material will be reviewed by the assigned associate editor and reviewers of the manuscript.


2021 ◽  
Author(s):  
Bogi Haryo Nugroho ◽  
Brahmantiyo Aji Sumarto ◽  
Muhammad Arief Joenaedy ◽  
Huda Jassim Al-Aradi ◽  
Pajar Rahman Achmad

Abstract Objective/scope It has been a challenge to analyze and estimate reliable water cut. The current well test data is not sufficient to satisfy the required information for prediction of the rate and water cut behaviors. Only on wells having stable and good behaviors, water cut levels can be estimated appropriately. The wells have Electrical Submersible Pump (ESP) sensor reading and data acquisition recorded in real-time help to fill this gap. The data are stored and available in KOC data repositories, such as Corporate Database, Well Surveillance Management System (WSMS), and Artificial Lift Management System (ALMS) Engineers spend this effort in spreadsheets and working with multiple data repositories. It is fit for data analysis by combining the data into a simple data set and presentation. Nevertheless, spreadsheets do not address a number of important tasks in a typical analyst's pipeline, and their design frequently complicates the analyses. It may take hours for single well analysis and days for multi-wells analysis and could be too late to plan and take preventive actions. Concerning the above situation, collaboration has been performed between NFD-North Kuwait and Information Management Team. In this first phase, this initiative is to design a conceptual integrated preventive system, which provide easy and quick tool to compute water cut estimation from well tests and downhole sensors data by using data science approach. Method, procedure, process There are 5 steps were applied in this initial work. It was included but not limited to user interview, exercise and performed data dissemination. It included gather full knowledge and defining the goal. Mapping pain points to solution also conducted to identify the technical challenge and find ways to overcome them. In the end of this stage, data and process review was conducted and applied for a given simple example to understand the requirements, demonstrate technical functionality and verify technical feasibility. Then conceptual design was built based on the requirements, features, and solutions gathered. Integrated system solution was recommended to include intermediate layer for integration, data retrieval, running calculation-heavy process in background, model optimization, visual analytics, decision-making, and automation. A roadmap with complete planning of different phases is then provided to achieve the objective. Results, observations, conclusions Process, functionalities, requirements, and finding have been examined and elaborated. The conceptual design has proved and assured the utilization of ESP sensor data in helping to estimate continuous well water cut's behavior. Further, the next implementation phase of data science expects an increase of confidence level of the results into higher degree. The design is promising to achieve the requirement to provide seamless, scalable, and easy to deploy automation capability tools for data analytic workflow with several major business benefits arising. Proposed solution includes combination of technologies, implementation services, and project management. The proposed technology components are distributed into 3 layers, source data, data science layer, and visual analytics layer. Furthermore, a roadmap of the project along with the recommendation for each phase has also been included. Novel/additive information Data Science for Exploration and Production is new area in which research and development will be required. Data science driven approach and application of digital transformation enables an integrated preventive system providing solution to compute water cut estimation from well tests and downhole sensors data. In the next larger scale of implementation, this system is expected to provide automated workflow supporting engineers in their daily tasks leveraging Data to Decision (D2D) approach. Machine learning is a data analytics technique that teaches computers to do what comes naturally to human, which is learn from experience. Machine learning algorithm use computational methods to learn information from the data without relying on predetermined equation as a model. Adding artificial intelligence and machine learning capability into the process requires knowledge on input data, the impact of data on the output, understanding of machine learning algorithm and building the model required to meet the expected output.


2020 ◽  
pp. 65-70
Author(s):  
S. Arndt ◽  
M. Begoin ◽  
M. Runnwerth

The German National Library for Science and Technology (TIB) seizes the opportunity of an epochal change into the Digital Age, inter alia, by maintaining a prestigious research department covering the areas data science & digital libraries, visual analytics, scientific data management, knowledge infrastructures, learning & skill analytics, open science, and non-textual media. Without neglecting the original mission of collecting and curating literature for a widespread access to scientific information, TIB merges well-established processes with intelligent assistance tools. The Specialised Information Service for Mobility and Traffic Science (FID move) is one example of combining the mentioned research areas in order to build a user-centred subject-specific research infrastructure to support and shape tomorrow’s scientific work. We give a detailed introduction to the project’s action fields: web service platform, information supply with a focus on open access, strategy & structure for reusable research data, research community exchange & networking, communication strategies for the public & for scientists. Exemplary, we present the ongoing activities in building a comprehensive knowledge organisation system for e-mobility.


Author(s):  
Charles Bouveyron ◽  
Gilles Celeux ◽  
T. Brendan Murphy ◽  
Adrian E. Raftery

Sign in / Sign up

Export Citation Format

Share Document