Graph analytics and visualization for cyber situational understanding

This paper describes the Cyber Situational Understanding (Cyber SU) Proof of Concept (CySUP) software system for exploring advanced Cyber SU capabilities. CySUP distills complex interrelationships among cyberspace entities to provide the “so what” of cyber events for tactical operations. It combines a variety of software components to build an end-to-end pipeline for live data ingest that populates a graph knowledge base, with query-driven exploratory analysis and interactive visualizations. CySUP integrates with the core infrastructure environment supporting command posts to provide a cyber overlay onto a common operating picture oriented to tactical commanders. It also supports detailed analysis of cyberspace entities and relationships driven by ad hoc graph queries, including the conversion of natural language inquiries to formal query language. To help assess its Cyber SU capabilities, CySUP leverages automated cyber adversary emulation to carry out controlled cyberattack campaigns that impact elements of tactical missions.

Download Full-text

A Natural Language Processing Approach to Measuring Treatment Adherence and Consistency Using Semantic Similarity

AERA Open ◽

10.1177/23328584211028615 ◽

2021 ◽

Vol 7 ◽

pp. 233285842110286

Author(s):

Kylie L. Anglin ◽

Vivian C. Wong ◽

Arielle Boguslav

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Semantic Similarity ◽

Language Processing ◽

Intervention Implementation ◽

Proof Of Concept ◽

Coaching Intervention ◽

Processing Techniques ◽

Teacher Coaching ◽

The Impact

Though there is widespread recognition of the importance of implementation research, evaluators often face intense logistical, budgetary, and methodological challenges in their efforts to assess intervention implementation in the field. This article proposes a set of natural language processing techniques called semantic similarity as an innovative and scalable method of measuring implementation constructs. Semantic similarity methods are an automated approach to quantifying the similarity between texts. By applying semantic similarity to transcripts of intervention sessions, researchers can use the method to determine whether an intervention was delivered with adherence to a structured protocol, and the extent to which an intervention was replicated with consistency across sessions, sites, and studies. This article provides an overview of semantic similarity methods, describes their application within the context of educational evaluations, and provides a proof of concept using an experimental study of the impact of a standardized teacher coaching intervention.

Download Full-text

Array databases: concepts, standards, implementations

Journal Of Big Data ◽

10.1186/s40537-020-00399-2 ◽

2021 ◽

Vol 8 (1) ◽

Author(s):

Peter Baumann ◽

Dimitar Misev ◽

Vlad Merticariu ◽

Bang Pham Huu

Keyword(s):

Service Quality ◽

Ad Hoc ◽

Query Language ◽

Distributed Processing ◽

Database Systems ◽

Database Technology ◽

Comprehensive Survey ◽

Spatio Temporal ◽

And Performance ◽

Array Databases

AbstractMulti-dimensional arrays (also known as raster data or gridded data) play a key role in many, if not all science and engineering domains where they typically represent spatio-temporal sensor, image, simulation output, or statistics “datacubes”. As classic database technology does not support arrays adequately, such data today are maintained mostly in silo solutions, with architectures that tend to erode and not keep up with the increasing requirements on performance and service quality. Array Database systems attempt to close this gap by providing declarative query support for flexible ad-hoc analytics on large n-D arrays, similar to what SQL offers on set-oriented data, XQuery on hierarchical data, and SPARQL and CIPHER on graph data. Today, Petascale Array Database installations exist, employing massive parallelism and distributed processing. Hence, questions arise about technology and standards available, usability, and overall maturity. Several papers have compared models and formalisms, and benchmarks have been undertaken as well, typically comparing two systems against each other. While each of these represent valuable research to the best of our knowledge there is no comprehensive survey combining model, query language, architecture, and practical usability, and performance aspects. The size of this comparison differentiates our study as well with 19 systems compared, four benchmarked to an extent and depth clearly exceeding previous papers in the field; for example, subsetting tests were designed in a way that systems cannot be tuned to specifically these queries. It is hoped that this gives a representative overview to all who want to immerse into the field as well as a clear guidance to those who need to choose the best suited datacube tool for their application. This article presents results of the Research Data Alliance (RDA) Array Database Assessment Working Group (ADA:WG), a subgroup of the Big Data Interest Group. It has elicited the state of the art in Array Databases, technically supported by IEEE GRSS and CODATA Germany, to answer the question: how can data scientists and engineers benefit from Array Database technology? As it turns out, Array Databases can offer significant advantages in terms of flexibility, functionality, extensibility, as well as performance and scalability—in total, the database approach of offering “datacubes” analysis-ready heralds a new level of service quality. Investigation shows that there is a lively ecosystem of technology with increasing uptake, and proven array analytics standards are in place. Consequently, such approaches have to be considered a serious option for datacube services in science, engineering and beyond. Tools, though, vary greatly in functionality and performance as it turns out.

Download Full-text

Adolescent impulsiveness and use of alcohol and tobacco

European Journal of Investigation in Health, Psychology and Education ◽

10.3390/ejihpe5030033 ◽

2015 ◽

Vol 5 (3) ◽

pp. 371-382

Author(s):

Mª del Carmen Pérez-Fuentes ◽

José J. Gázquez ◽

Mª del Mar Molero ◽

Fernando Cardila ◽

África Martos ◽

...

Keyword(s):

Risk Factor ◽

Drug Use ◽

Detailed Analysis ◽

Intervention Program ◽

Ad Hoc ◽

Demographic Characteristics ◽

The State ◽

Negative Consequences ◽

Frequency Of Use ◽

The Relationship

Adolescence is characterized by premature experimentation with new experiences and sensations. These experiences sometimes include drugs, which even though legal and socially accepted, begin to have noticeable negative consequences to the adolescent’s development. In recent years, a decrease in use of tobacco by Spanish adolescents has been observed, but not in alcohol. One of the causes of initiation in drug use is impulsive personality or behavior. Thus the purpose of this study was to analyze the relationship between impulsiveness and frequency of use of alcohol and tobacco in 822 students aged 13 to 18 years of age. The State Impulsivity Scale (SIS) and an ad hoc questionnaire on demographic characteristics and use of alcohol and tobacco were used for this. The results showed that students who stated they were users scored significantly higher on impulsivity. Thus detailed analysis of the profile of individuals with this risk factor could favor more adequate intervention program design.

Download Full-text

Robust simulation of mineral precipitation–dissolution problems with variable mineral surface area

Journal of Engineering Mathematics ◽

10.1007/s10665-021-10132-4 ◽

2021 ◽

Vol 129 (1) ◽

Author(s):

Serge Kräutle ◽

Jan Hodai ◽

Peter Knabner

Keyword(s):

Surface Area ◽

Ad Hoc ◽

Chemical Species ◽

Special Focus ◽

Proof Of Concept ◽

Mineral Surface ◽

Mineral Precipitation ◽

Numerical Tests ◽

Ill Posed ◽

Mineral Surface Area

AbstractWe consider a macroscale model of transport and reaction of chemical species in a porous medium with a special focus on mineral precipitation–dissolution processes. In the literature, it is frequently proposed that the reaction rate should depend on the reactive mineral surface area, and so on the amount of mineral. We point out that a frequently used model is ill posed in the sense that it admits non-unique solutions. We investigate what consequences this non-uniqueness has on the numerical solution of the model. The main novelty in this article is our proposal of a certain substitution which removes the ill-posedness from the system and which leads to better numerical results than some “ad hoc methods.” We think that the proposed substitution is a rather elegant way to get rid of the non-uniqueness and the numerical difficulties and is much less technical than other ideas. As a proof of concept, we present some numerical tests and simulations for the new model.

Download Full-text

Traffic Congestion Reduction and Accident Circumvention System via Incorporation of CAV and VANET

International Journal of Ambient Computing and Intelligence ◽

10.4018/ijaci.2021010103 ◽

2021 ◽

Vol 12 (1) ◽

pp. 53-72

Author(s):

Mohsin Khan ◽

Bhavna Arora

Keyword(s):

Real Time ◽

Data Transmission ◽

Traffic Congestion ◽

Ad Hoc Network ◽

Discrete System ◽

Ad Hoc ◽

New Age ◽

The Real ◽

Automated Vehicle ◽

The Core

Connected automated vehicle (CAV) technology is the core for the new age vehicles in research phase to communicate with one another and assimilation of vehicular ad-hoc network (VANET) for the transference of data between vehicles at a quantified place and time. This manuscript is an enactment of the algorithms associated to the maintenance of secure distance amongst vehicles, lane shifting, and overtaking, which will diminish the occurrence of collisions and congestions especially phantom jams. Those implementations are centered over CAV and VANET technology for the interconnection of the vehicles and the data transmission. The data is associated to the aspects of a vehicle such as speed, position, acceleration, and acknowledgements, which acts as the fundamentals for the computation of variables. In accordance with the environment of a particular vehicle (i.e., its surrounding vehicles), real-time decisions are taken based on the real-time computation of the variables in a discrete system.

Download Full-text

Composing Questions through Conceptual Authoring

Computational Linguistics ◽

10.1162/coli.2007.33.1.105 ◽

2007 ◽

Vol 33 (1) ◽

pp. 105-133 ◽

Cited By ~ 26

Author(s):

Catalina Hallett ◽

Donia Scott ◽

Richard Power

Keyword(s):

Natural Language ◽

Question Answering ◽

Free Text ◽

Risk Averse ◽

Proof Of Concept ◽

Concept System ◽

Complex Queries ◽

Extensive Training ◽

Question Answering Systems ◽

Medical Histories

This article describes a method for composing fluent and complex natural language questions, while avoiding the standard pitfalls of free text queries. The method, based on Conceptual Authoring, is targeted at question-answering systems where reliability and transparency are critical, and where users cannot be expected to undergo extensive training in question composition. This scenario is found in most corporate domains, especially in applications that are risk-averse. We present a proof-of-concept system we have developed: a question-answering interface to a large repository of medical histories in the area of cancer. We show that the method allows users to successfully and reliably compose complex queries with minimal training.

Download Full-text

Human Annotated Dialogues Dataset for Natural Conversational Agents

Applied Sciences ◽

10.3390/app10030762 ◽

2020 ◽

Vol 10 (3) ◽

pp. 762

Author(s):

Erinc Merdivan ◽

Deepika Singh ◽

Sten Hanke ◽

Johannes Kropf ◽

Andreas Holzinger ◽

...

Keyword(s):

Natural Language ◽

Detailed Analysis ◽

Human Perception ◽

Natural Language Understanding ◽

Industrial Applications ◽

Benchmark Dataset ◽

Conversational Agents ◽

High Quality ◽

Language Understanding ◽

Major Drawback

Conversational agents are gaining huge popularity in industrial applications such as digital assistants, chatbots, and particularly systems for natural language understanding (NLU). However, a major drawback is the unavailability of a common metric to evaluate the replies against human judgement for conversational agents. In this paper, we develop a benchmark dataset with human annotations and diverse replies that can be used to develop such metric for conversational agents. The paper introduces a high-quality human annotated movie dialogue dataset, HUMOD, that is developed from the Cornell movie dialogues dataset. This new dataset comprises 28,500 human responses from 9500 multi-turn dialogue history-reply pairs. Human responses include: (i) ratings of the dialogue reply in relevance to the dialogue history; and (ii) unique dialogue replies for each dialogue history from the users. Such unique dialogue replies enable researchers in evaluating their models against six unique human responses for each given history. Detailed analysis on how dialogues are structured and human perception on dialogue score in comparison with existing models are also presented.

Download Full-text

A Query Language for Workflow Logs

ACM Transactions on Management Information Systems ◽

10.1145/3482968 ◽

2022 ◽

Vol 13 (2) ◽

pp. 1-28

Author(s):

Yan Tang ◽

Weilong Cui ◽

Jianwen Su

Keyword(s):

Business Process ◽

Evaluation Method ◽

Ad Hoc ◽

Query Language ◽

Cost Model ◽

Formal Semantics ◽

Control Flow ◽

Query Evaluation ◽

Evaluation Algorithm ◽

Laws And Policies

A business process (workflow) is an assembly of tasks to accomplish a business goal. Real-world workflow models often demanded to change due to new laws and policies, changes in the environment, and so on. To understand the inner workings of a business process to facilitate changes, workflow logs have the potential to enable inspecting, monitoring, diagnosing, analyzing, and improving the design of a complex workflow. Querying workflow logs, however, is still mostly an ad hoc practice by workflow managers. In this article, we focus on the problem of querying workflow log concerning both control flow and dataflow properties. We develop a query language based on “incident patterns” to allow the user to directly query workflow logs instead of having to transform such queries into database operations. We provide the formal semantics and a query evaluation algorithm of our language. By deriving an accurate cost model, we develop an optimization mechanism to accelerate query evaluation. Our experiment results demonstrate the effectiveness of the optimization and achieves up to 50× speedup over an adaption of existing evaluation method.

Download Full-text

AODV-UI Proof of Concept on MIPS-based Wireless Router

Journal of Communications Software and Systems ◽

10.24138/jcomss.v10i1.136 ◽

2014 ◽

Vol 10 (1) ◽

pp. 14

Author(s):

B. Anantasatya Adhi ◽

Ruki Harwahyu ◽

Abdusy Syarif ◽

Harris Simaremare ◽

R. Fitri Sari ◽

...

Keyword(s):

Routing Protocol ◽

Network Performance ◽

Ad Hoc ◽

Delivery Ratio ◽

Proof Of Concept ◽

Mobile Nodes ◽

Media Access ◽

Wireless Router ◽

Hardware Configuration ◽

Aodv Routing Protocol

AODV routing protocol facilitates changing and simple-to-setup network environment. It helps setting up a network without sufficient infrastructure, such as in disaster area. Development of AODV protocol has gathered a worldwide research interest. However, not many researches implement AODV routing protocol in real mobile nodes and real MANET. In addition, real implementation deals with other works concerning underlying protocol, firmware and hardware configuration, as well as detailed topology both in logical and physical arrangement. This work aims to implements Ad-hoc On-demand Distant Vector – particularly University of Indonesia AODV (AODV-UI) routing protocol on low-end inexpensive generic wireless routers as a proof of concept. AODV-UI is an improved version of AODV routing protocol that implements gateway interconnection and reverse route capability. This routing protocol has been previously successfully tested in NS-2. In this work, current AODV-UI protocol is ported to OpenWRT + MIPS (Microprocessor without Interlocked Pipeline Stages) little endian architecture then tested on the real networking environment. Underlying media access layer is also altered to provide the protocol greater control over the network. Performance of this implementation is measured in terms of energy consumption, routing overhead, end-to-end delay, protocol reliability and packet delivery ratio.

Download Full-text

Προσαρμοστική επιδημική διάδοση σε ασύρματα αδόμητα δίκτυα

10.12681/eadd/45651 ◽

2018 ◽

Author(s):

Θεοφάνης Κοντός

Keyword(s):

Quality Of Service ◽

Ad Hoc ◽

Proof Of Concept ◽

One Stage ◽

Look Ahead ◽

Cross Layering

Το αντικείμενο της έρευνας αυτής είναι η επιδημική διάδοση (epidemic dissemination)πληροφορίας σε ασύρματα αδόμητα (ad hoc) δίκτυα με προσαρμοστικό τρόπο. Στό-χοι είναι η συμφιλίωση των απαιτήσεων για ευρεία διάδοση της πληροφορίας και γιαμειωμένο ενεργειακό κόστος, η εκτίμηση μιας βέλτιστης λύσης του προβλήματος αλ-λά και η επίτευξη της βέλτιστης απόδοσης στους εν λόγω τομείς με υψηλής ποιότηταςπληροφορία.Η προσαρμοστική επιδημική διάδοση (Π.Ε.Δ.) σε αδόμητα δίκτυα σε θορυβώδες περι-βάλλον είναι πρόβλημα με πολλές παραμέτρους.Με τη χρήση της διαστρωμάτωσης (cross-layering) με βάση την επίγνωση κατάστασηςκαναλιού μπορούν να κατασκευαστούν σχήματα προσαρμοστικής διάδοσης, τα οποίαεξοικονομούν ενέργεια διατηρώντας ταχεία και αποτελεσματική διάχυση πληροφορίαςή μόλυνση του δικτύου, σύμφωνα με την ορολογία της επιδημικής διάδοσης [1].Αυτού του είδους η αντιμετώπιση [1] είναι αναδραστική. Μπορεί όμως να σχεδιαστείμια εκδοχή σχημάτων με πρόβλεψη η οποία χρησιμοποιεί συναρτήσεις οφέλους για τηνπροσαρμογή των χαρακτηριστικών εκπομπής. Η προτεινόμενη λύση είναι μια απόδει-ξη της αρχής λειτουργίας (proof of concept, PoC) αυτής της ιδέας. Με την αντιμετώπισηαυτή εξοικονομείται ενέργεια ενώ και η διάχυση πληροφορίας είναι επιτυχής [2].Το πιο πάνω σχήμα μπορεί να ενισχυθεί με τη χρήση βελτιστοποίησης. Ελέγχονταςποια προσαρμογή είναι η πλέον σύμφορη σε κάθε χρονική στιγμή όπως στο [2], επι-τυγχάνεται μια έστω και «μυωπική» ορατότητα στο άμεσο μέλλον. Με τη χρήση τηςθεωρίας βέλτιστης παύσης (β.π.) μπορεί να αποκτηθεί καλύτερη ορατότητα σε βάθοςχρόνου και να προσεγγιστεί η βελτιστοποίηση. Η πραγματική βελτιστοποίηση εξαρτά-ται από την ικανοποίηση συνθηκών β.π. Αν αυτές δεν ικανοποιούνται πλήρως, τότεκαι οι αποκτώμενες λύσεις δεν είναι κατ’ ανάγκη βέλτιστες. Αυτό φαίνεται στην [3] μετη χρήση του κανόνα β.π. (one-stage-look-ahead, 1sla). Μπορεί να επιτευχθεί βελτί-ωση σε σχέση με μη προσαρμοστικές λύσεις ως προς την εξοικονόμηση ενέργειας. Ηευρεία μόλυνση του δικτύου εξακολουθεί να επιτυγχάνεται [3].Εξετάζεται στη συνέχεια ως πρόβλημα Π.Ε.Δ. ο χρονοπρογραμματισμός τακτικών εκ-πομπών. Η βελτιστοποίηση προσεγγίζεται με τη μετάβαση από αυστηρά περιοδικέςεκπομπές σε πλησιοπεριοδικό καθεστώς. Ο χρονοπρογραμματισμός πραγματοποιεί-ται με τη βοήθεια των ιδίων εργαλείων, δηλαδή διαστρωματικές συναρτήσεις οφέλουςκαι χρήση μηχανισμού β.π. Το πρόβλημα αυτό μοντελοποιείται ως κλασικό πρόβλημαγραμματέως, στο οποίο η χρήση β.π. προσφέρει βελτιστοποίηση. Σε σχέση με σχή-ματα χωρίς β.π. [2] και με μη προσαρμοστικά το σχήμα αυτό είναι αποδοτικότερο στηνεξοικονόμηση ενέργειας και επιτρέπει εξ ίσου επιτυχημένη μόλυνση του δικτύου. Είναιδυνατόν να συνυπολογιστεί και το ενεργειακό κόστος απόκτησης πληροφορίας κατά-στασης καναλιού (ΠΚΚ). Με τη βοήθεια αυτού του σχήματος, το δίκτυο συγκλίνει σεκατάσταση στην οποία το κόστος διάχυσης είναι σημαντικά ελαττωμένο ενώ σημαντικόποσοστό του εναπομένοντος κόστους οφείλεται στην ανάκτηση ΠΚΚ ([4]). Οι ακριβείςσυσχετισμοί εξαρτώνται από τις παραμέτρους του προβλήματος [4].Περαιτέρω θίγεται η δυνατότητα βελτίωσης της ποιότητας πληροφορίας με την αξιο-ποίηση σχημάτων σαν αυτό της [4]. Αρχικά θεωρείται ως ποιότητα η νεότητα της δια-χεόμενης πληροφορίας. Προτείνεται προσαρμογή των χαρακτηριστικών εκπομπής μεβάση τη νεότητα της μολύνουσας πληροφορίας με στόχο τη βελτίωση της μέσης ηλικί-ας της μολύνουσας τους κόμβους πληροφορίας. Με τον τρόπο αυτό εισάγεται η ιδέατης προσαρμογής των χαρακτηριστικών εκπομπής με βάση το περιεχόμενο της ίδιαςτης φερόμενης πληροφορίας.Ο μετριασμός των ευρυεκπομπών κατά τη φάση ανακάλυψης μονοπατιού στα πρω-τόκολλα δρομολόγησης είναι ένας ακόμη ελκυστικός στόχος και αντιμετωπίζεται σταπλαίσια του πρωτοκόλλου AODV. Εδώ αναδεικνύεται ο ανταγωνισμός μεταξύ εξοικο-νόμησης ενεργειακού κόστους και ταχείας εύρεσης πληροφορίας δρομολόγησης. Σεμια απόπειρα επέκτασης της τεχνικής στην εκπομπή της ίδιας της δρομολογούμενηςπληροφορίας (των δεδομένων δηλαδή), μεταπίπτουμε σε περιβάλλον μονοεκπομπών(unicast). Εδώ, ο μετριασμός εκπομπών μπορεί να έχει σοβαρές συνέπειες στη μετά-δοση πληροφορίας και καταδεικνύεται η ανάγκη εξέτασης του προβλήματος με τη βο-ήθεια μετρικών που εκφράζουν την έννοια της ποιότητας υπηρεσίας (quality of service,QoS) και κατ’ επέκταση της ποιότητας της πληροφορίας.Το εξεταζόμενο πρόβλημα χρονοπρογραμματισμού με την πρόσθετη νέα απαίτησηςτης βελτίωσης της ποιότητας πληροφορίας μπορεί να διατυπωθεί αυστηρά ως πρό-βλημα βέλτιστης παύσης με πεπερασμένο γνωστό ορίζοντα.Η ποιότητα πληροφορίας μπορεί να ορίζεται με βάση ιδιότητές της (όπως η νεότηταπου αναφέρθηκε) ή παραμέτρους του πρωτοκόλλου στα πλαίσια του οποίου γίνεται ηδιάχυσή της, όπως το TTL (time-to-live) στην περίπτωση του AODV.Έτσι η ποιότητα πληροφορίας αναδεικνύεται σε μια τρίτη συνιστώσα πέρα από το ε-νεργειακό κόστος και το ποσοστό μόλυνσης, η οποία λαμβάνει μέρος στον υπό εξέτασηαναδυόμενο ανταγωνισμό. Ανάλογα με την οριζόμενη ποιότητα πληροφορίας είναι δυ-νατή η εύρεση συμβιβασμού μεταξύ των τριών αυτών απαιτήσεων.

Download Full-text