benchmarking performance Latest Research Papers

AbstractDeep saliency models represent the current state-of-the-art for predicting where humans look in real-world scenes. However, for deep saliency models to inform cognitive theories of attention, we need to know how deep saliency models prioritize different scene features to predict where people look. Here we open the black box of three prominent deep saliency models (MSI-Net, DeepGaze II, and SAM-ResNet) using an approach that models the association between attention, deep saliency model output, and low-, mid-, and high-level scene features. Specifically, we measured the association between each deep saliency model and low-level image saliency, mid-level contour symmetry and junctions, and high-level meaning by applying a mixed effects modeling approach to a large eye movement dataset. We found that all three deep saliency models were most strongly associated with high-level and low-level features, but exhibited qualitatively different feature weightings and interaction patterns. These findings suggest that prominent deep saliency models are primarily learning image features associated with high-level scene meaning and low-level image saliency and highlight the importance of moving beyond simply benchmarking performance.

Download Full-text

Laughing Heads: Can Transformers Detect What Makes a Sentence Funny?

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2021/537 ◽

2021 ◽

Author(s):

Maxime Peyrard ◽

Beatriz Borges ◽

Kristina Gligorić ◽

Robert West

Keyword(s):

Natural Language Processing ◽

Language Processing ◽

Test Sentence ◽

High Accuracy ◽

The Other ◽

Grand Challenge ◽

Training Time ◽

Minimal Pairs ◽

Different Sources ◽

Benchmarking Performance

The automatic detection of humor poses a grand challenge for natural language processing. Transformer-based systems have recently achieved remarkable results on this task, but they usually (1) were evaluated in setups where serious vs humorous texts came from entirely different sources, and (2) focused on benchmarking performance without providing insights into how the models work. We make progress in both respects by training and analyzing transformer-based humor recognition models on a recently introduced dataset consisting of minimal pairs of aligned sentences, one serious, the other humorous. We find that, although our aligned dataset is much harder than previous datasets, transformer-based models recognize the humorous sentence in an aligned pair with high accuracy (78\%). In a careful error analysis, we characterize easy vs hard instances. Finally, by analyzing attention weights, we obtain important insights into the mechanisms by which transformers recognize humor. Most remarkably, we find clear evidence that one single attention head learns to recognize the words that make a test sentence humorous, even without access to this information at training time.

Download Full-text

BENCHMARKING PERFORMANCE OF BRAZILIAN CONTAINER TERMINALS AND THE REST OF THE WORLD

Revista Eletrônica de Estratégia & Negócios ◽

10.19177/reen.v14ei2021161-182 ◽

2021 ◽

Vol 14 (I) ◽

pp. 161

Author(s):

Eduardo Greco

Keyword(s):

Container Terminals ◽

The World ◽

Benchmarking Performance

Download Full-text

Conventional, High-Resolution and Imaging Flow Cytometry: Benchmarking Performance in Characterisation of Extracellular Vesicles

Biomedicines ◽

10.3390/biomedicines9020124 ◽

2021 ◽

Vol 9 (2) ◽

pp. 124

Author(s):

Jaco Botha ◽

Haley R. Pugsley ◽

Aase Handberg

Keyword(s):

Flow Cytometry ◽

High Resolution ◽

Extracellular Vesicles ◽

Single Particles ◽

Multiple Parameters ◽

Imaging Flow Cytometry ◽

Highly Sensitive ◽

Individual Project ◽

Benchmarking Performance ◽

High Throughput Manner

Flow cytometry remains a commonly used methodology due to its ability to characterise multiple parameters on single particles in a high-throughput manner. In order to address limitations with lacking sensitivity of conventional flow cytometry to characterise extracellular vesicles (EVs), novel, highly sensitive platforms, such as high-resolution and imaging flow cytometers, have been developed. We provided comparative benchmarks of a conventional FACS Aria III, a high-resolution Apogee A60 Micro-PLUS and the ImageStream X Mk II imaging flow cytometry platform. Nanospheres were used to systematically characterise the abilities of each platform to detect and quantify populations with different sizes, refractive indices and fluorescence properties, and the repeatability in concentration determinations was reported for each population. We evaluated the ability of the three platforms to detect different EV phenotypes in blood plasma and the intra-day, inter-day and global variabilities in determining EV concentrations. By applying this or similar methodology to characterise methods, researchers would be able to make informed decisions on choice of platforms and thereby be able to match suitable flow cytometry platforms with projects based on the needs of each individual project. This would greatly contribute to improving the robustness and reproducibility of EV studies.

Download Full-text

Benchmarking performance of RaySGD and Horovod for big data applications

2020 IEEE International Conference on Big Data (Big Data) ◽

10.1109/bigdata50022.2020.9378470 ◽

2020 ◽

Author(s):

Shruti Kunde ◽

Amey Pandit ◽

Rekha Singhal

Keyword(s):

Big Data ◽

Big Data Applications ◽

Benchmarking Performance

Download Full-text

Benchmarking performance of different noise detection techniques on data stream clustering

Proceedings of the 10th Euro-American Conference on Telematics and Information Systems ◽

10.1145/3401895.3401898 ◽

2020 ◽

Author(s):

Sonia Jaramillo-Valbuena ◽

Eduardo Carrillo-Zambrano ◽

Edwin Romero-Cuero

Keyword(s):

Data Stream ◽

Noise Detection ◽

Detection Techniques ◽

Stream Clustering ◽

Data Stream Clustering ◽

Benchmarking Performance

Download Full-text

Benchmarking Performance in Pancreatic Surgery: a Systematic Review of Published Quality Metrics

Journal of Gastrointestinal Surgery ◽

10.1007/s11605-020-04827-9 ◽

2020 ◽

Author(s):

Cindy Ou ◽

Michaela Rektorysova ◽

Bushra Othman ◽

John A. Windsor ◽

Sanjay Pandanaboyana ◽

...

Keyword(s):

Systematic Review ◽

Pancreatic Surgery ◽

Quality Metrics ◽

Benchmarking Performance

Download Full-text

Benchmarking Performance of Erasure Codes for Linux Filesystem EXT4, XFS and BTRFS

Advances in Intelligent Systems and Computing - Progress in Advanced Computing and Intelligent Engineering ◽

10.1007/978-981-15-6584-7_32 ◽

2020 ◽

pp. 325-334

Author(s):

Shreya Bokare ◽

Sanjay S. Pawar

Keyword(s):

Erasure Codes ◽

Benchmarking Performance

Download Full-text

benchmarking performance
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

The Role of Ligand Rebinding and Facilitated Dissociation on the Characterization of Dissociation Rates by Surface Plasmon Resonance (SPR) and Benchmarking Performance Metrics

Hospital-specific Template Matching for Benchmarking Performance in a Diverse Multihospital System

Deep saliency models learn low-, mid-, and high-level features to predict scene attention

Laughing Heads: Can Transformers Detect What Makes a Sentence Funny?

BENCHMARKING PERFORMANCE OF BRAZILIAN CONTAINER TERMINALS AND THE REST OF THE WORLD

Conventional, High-Resolution and Imaging Flow Cytometry: Benchmarking Performance in Characterisation of Extracellular Vesicles

Benchmarking performance of RaySGD and Horovod for big data applications

Benchmarking performance of different noise detection techniques on data stream clustering

Benchmarking Performance in Pancreatic Surgery: a Systematic Review of Published Quality Metrics

Benchmarking Performance of Erasure Codes for Linux Filesystem EXT4, XFS and BTRFS

Export Citation Format

benchmarking performanceRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

The Role of Ligand Rebinding and Facilitated Dissociation on the Characterization of Dissociation Rates by Surface Plasmon Resonance (SPR) and Benchmarking Performance Metrics

Hospital-specific Template Matching for Benchmarking Performance in a Diverse Multihospital System

Deep saliency models learn low-, mid-, and high-level features to predict scene attention

Laughing Heads: Can Transformers Detect What Makes a Sentence Funny?

BENCHMARKING PERFORMANCE OF BRAZILIAN CONTAINER TERMINALS AND THE REST OF THE WORLD

Conventional, High-Resolution and Imaging Flow Cytometry: Benchmarking Performance in Characterisation of Extracellular Vesicles

Benchmarking performance of RaySGD and Horovod for big data applications

Benchmarking performance of different noise detection techniques on data stream clustering

Benchmarking Performance in Pancreatic Surgery: a Systematic Review of Published Quality Metrics

Benchmarking Performance of Erasure Codes for Linux Filesystem EXT4, XFS and BTRFS

benchmarking performance
Recently Published Documents