EyeTrackUAV2: A Large-Scale Binocular Eye-Tracking Dataset for UAV Videos

The fast and tremendous evolution of the unmanned aerial vehicle (UAV) imagery gives place to the multiplication of applications in various fields such as military and civilian surveillance, delivery services, and wildlife monitoring. Combining UAV imagery with study of dynamic salience further extends the number of future applications. Indeed, considerations of visual attention open the door to new avenues in a number of scientific fields such as compression, retargeting, and decision-making tools. To conduct saliency studies, we identified the need for new large-scale eye-tracking datasets for visual salience in UAV content. Therefore, we address this need by introducing the dataset EyeTrackUAV2. It consists of the collection of precise binocular gaze information (1000 Hz) over 43 videos (RGB, 30 fps, 1280 × 720 or 720 × 480). Thirty participants observed stimuli under both free viewing and task conditions. Fixations and saccades were then computed with the dispersion-threshold identification (I-DT) algorithm, while gaze density maps were calculated by filtering eye positions with a Gaussian kernel. An analysis of collected gaze positions provides recommendations for visual salience ground-truth generation. It also sheds light upon variations of saliency biases in UAV videos when opposed to conventional content, especially regarding the center bias.

Download Full-text

Direct Measure Matching for Crowd Counting

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2021/116 ◽

2021 ◽

Author(s):

Hui Lin ◽

Xiaopeng Hong ◽

Zhiheng Ma ◽

Xing Wei ◽

Yunfeng Qiu ◽

...

Keyword(s):

Loss Function ◽

Ground Truth ◽

Optimization Method ◽

Gaussian Kernel ◽

Direct Measure ◽

Matching Problem ◽

Crowd Counting ◽

Counting Approach ◽

Density Maps ◽

Inaccurate Estimation

Traditional crowd counting approaches usually use Gaussian assumption to generate pseudo density ground truth, which suffers from problems like inaccurate estimation of the Gaussian kernel sizes. In this paper, we propose a new measure-based counting approach to regress the predicted density maps to the scattered point-annotated ground truth directly. First, crowd counting is formulated as a measure matching problem. Second, we derive a semi-balanced form of Sinkhorn divergence, based on which a Sinkhorn counting loss is designed for measure matching. Third, we propose a self-supervised mechanism by devising a Sinkhorn scale consistency loss to resist scale changes. Finally, an efficient optimization method is provided to minimize the overall loss function. Extensive experiments on four challenging crowd counting datasets namely ShanghaiTech, UCF-QNRF, JHU++ and NWPU have validated the proposed method.

Download Full-text

Model and Method for Contributor’s Quality Assessment in Community Image Tagging Systems

Information and Control Systems ◽

10.31799/1684-8853-2018-4-45-51 ◽

2018 ◽

pp. 45-51

Author(s):

A. V. Ponomarev

Keyword(s):

Large Scale ◽

Wide Spectrum ◽

Preference Relation ◽

Pairwise Comparison ◽

Ground Truth ◽

Comparison Method ◽

Characteristic Matrix ◽

Image Tagging ◽

Proposed Model

Introduction: Large-scale human-computer systems involving people of various skills and motivation into the information processing process are currently used in a wide spectrum of applications. An acute problem in such systems is assessing the expected quality of each contributor; for example, in order to penalize incompetent or inaccurate ones and to promote diligent ones.Purpose: To develop a method of assessing the expected contributor’s quality in community tagging systems. This method should only use generally unreliable and incomplete information provided by contributors (with ground truth tags unknown).Results:A mathematical model is proposed for community image tagging (including the model of a contributor), along with a method of assessing the expected contributor’s quality. The method is based on comparing tag sets provided by different contributors for the same images, being a modification of pairwise comparison method with preference relation replaced by a special domination characteristic. Expected contributors’ quality is evaluated as a positive eigenvector of a pairwise domination characteristic matrix. Community tagging simulation has confirmed that the proposed method allows you to adequately estimate the expected quality of community tagging system contributors (provided that the contributors' behavior fits the proposed model).Practical relevance: The obtained results can be used in the development of systems based on coordinated efforts of community (primarily, community tagging systems).

Download Full-text

Large-Scale Flows in the Local Universe

Symposium - International Astronomical Union ◽

10.1017/s0074180900110058 ◽

1996 ◽

Vol 168 ◽

pp. 175-182 ◽

Cited By ~ 1

Author(s):

D.S. Mathewson ◽

V.L. Ford

Keyword(s):

Large Scale ◽

Velocity Measurements ◽

Local Universe ◽

Peculiar Velocities ◽

Redshift Surveys ◽

Density Maps ◽

The Universe ◽

Great Wall ◽

Great Attractor ◽

Streaming Flow

Peculiar velocity measurements of 2500 southern spiral galaxies show large-scale flows in the direction of the Hydra-Centaurus clusters which fully participate in the flow themselves. The flow is not uniform over this region and seems to be associated with the denser regions which participate in the flow of amplitude about 400km/s. In the less dense regions the flow is small or non-existent. This makes the flow quite asymmetric and inconsistent with that expected from large-scale, parallel streaming flow that includes all galaxies out to 6000km/s as previously thought. The flow cannot be modelled by a Great Attractor at 4300km/s or the Centaurus clusters at 3500km/s. Indeed, from the density maps derived from the redshift surveys of “optical” and IRAS galaxies, it is difficult to see how the mass concentrations can be responsible particularly as they themselves participate in the flow. These results bring into question the generally accepted reason for the peculiar velocities of galaxies that they arise solely as a consequence of infall into the dense regions of the universe. To the N. of the Great Attractor region, the flow increases and shows no sign of diminishing out to the redshift limit of 8000km/s in this direction. We may have detected flow in the nearest section of the Great Wall.

Download Full-text

Critical Aspects of Person Counting and Density Estimation

Journal of Imaging ◽

10.3390/jimaging7020021 ◽

2021 ◽

Vol 7 (2) ◽

pp. 21

Author(s):

Roland Perko ◽

Manfred Klopschitz ◽

Alexander Almer ◽

Peter M. Roth

Keyword(s):

Density Estimation ◽

Network Architecture ◽

Reference Data ◽

State Of The Art ◽

Limit State ◽

Ground Truth ◽

Data Sets ◽

Ground Truth Generation ◽

Baseline Approach ◽

Critical Aspects

Many scientific studies deal with person counting and density estimation from single images. Recently, convolutional neural networks (CNNs) have been applied for these tasks. Even though often better results are reported, it is often not clear where the improvements are resulting from, and if the proposed approaches would generalize. Thus, the main goal of this paper was to identify the critical aspects of these tasks and to show how these limit state-of-the-art approaches. Based on these findings, we show how to mitigate these limitations. To this end, we implemented a CNN-based baseline approach, which we extended to deal with identified problems. These include the discovery of bias in the reference data sets, ambiguity in ground truth generation, and mismatching of evaluation metrics w.r.t. the training loss function. The experimental results show that our modifications allow for significantly outperforming the baseline in terms of the accuracy of person counts and density estimation. In this way, we get a deeper understanding of CNN-based person density estimation beyond the network architecture. Furthermore, our insights would allow to advance the field of person density estimation in general by highlighting current limitations in the evaluation protocols.

Download Full-text

Analysing political events on Twitter

ACM SIGIR Forum ◽

10.1145/3458537.3458542 ◽

2019 ◽

Vol 53 (1) ◽

pp. 38-39

Author(s):

Anjie Fang

Keyword(s):

Topic Model ◽

Ground Truth ◽

Topic Modelling ◽

Time Dimension ◽

Social Scientists ◽

Political Events ◽

Community Classification ◽

Ground Truth Generation ◽

Twitter Users ◽

Modelling Approach

Recently, political events, such as elections, have raised a lot of discussions on social media networks, in particular, Twitter. This brings new opportunities for social scientists to address social science tasks, such as understanding what communities said or identifying whether a community has an influence on another. However, identifying these communities and extracting what they said from social media data are challenging and non-trivial tasks. We aim to make progress towards understanding 'who' (i.e. communities) said 'what' (i.e. discussed topics) and 'when' (i.e. time) during political events on Twitter. While identifying the 'who' can benefit from Twitter user community classification approaches, 'what' they said and 'when' can be effectively addressed on Twitter by extracting their discussed topics using topic modelling approaches that also account for the importance of time on Twitter. To evaluate the quality of these topics, it is necessary to investigate how coherent these topics are to humans. Accordingly, we propose a series of approaches in this thesis. First, we investigate how to effectively evaluate the coherence of the topics generated using a topic modelling approach. The topic coherence metric evaluates the topical coherence by examining the semantic similarity among words in a topic. We argue that the semantic similarity of words in tweets can be effectively captured by using word embeddings trained using a Twitter background dataset. Through a user study, we demonstrate that our proposed word embedding-based topic coherence metric can assess the coherence of topics like humans [1, 2]. In addition, inspired by the precision at k metric, we propose to evaluate the coherence of a topic model (containing many topics) by averaging the top-ranked topics within the topic model [3]. Our proposed metrics can not only evaluate the coherence of topics and topic models, but also can help users to choose the most coherent topics. Second, we aim to extract topics with a high coherence from Twitter data. Such topics can be easily interpreted by humans and they can assist to examine 'what' has been discussed and 'when'. Indeed, we argue that topics can be discussed in different time periods (see [4]) and therefore can be effectively identified and distinguished by considering their time periods. Hence, we propose an effective time-sensitive topic modelling approach by integrating the time dimension of tweets (i.e. 'when') [5]. We show that the time dimension helps to generate topics with a high coherence. Hence, we argue that 'what' has been discussed and 'when' can be effectively addressed by our proposed time-sensitive topic modelling approach. Next, to identify 'who' participated in the topic discussions, we propose approaches to identify the community affiliations of Twitter users, including automatic ground-truth generation approaches and a user community classification approach. We show that the mentioned hashtags and entities in the users' tweets can indicate which community a Twitter user belongs to. Hence, we argue that they can be used to generate the ground-truth data for classifying users into communities. On the other hand, we argue that different communities favour different topic discussions and their community affiliations can be identified by leveraging the discussed topics. Accordingly, we propose a Topic-Based Naive Bayes (TBNB) classification approach to classify Twitter users based on their words and discussed topics [6]. We demonstrate that our TBNB classifier together with the ground-truth generation approaches can effectively identify the community affiliations of Twitter users. Finally, to show the generalisation of our approaches, we apply our approaches to analyse 3.6 million tweets related to US Election 2016 on Twitter [7]. We show that our TBNB approach can effectively identify the 'who', i.e. classify Twitter users into communities. To investigate 'what' these communities have discussed, we apply our time-sensitive topic modelling approach to extract coherent topics. We finally analyse the community-related topics evaluated and selected using our proposed topic coherence metrics. Overall, we contribute to provide effective approaches to assist social scientists towards analysing political events on Twitter. These approaches include topic coherence metrics, a time-sensitive topic modelling approach and approaches for classifying the community affiliations of Twitter users. Together they make progress to study and understand the connections and dynamics among communities on Twitter. Supervisors : Iadh Ounis, Craig Macdonald, Philip Habel The thesis is available at http://theses.gla.ac.uk/41135/

Download Full-text

Evaluation of two-view geometry methods with automatic ground-truth generation

Image and Vision Computing ◽

10.1016/j.imavis.2013.09.002 ◽

2013 ◽

Vol 31 (12) ◽

pp. 921-934 ◽

Cited By ~ 3

Author(s):

Ruan Lakemond ◽

Clinton Fookes ◽

Sridha Sridharan

Keyword(s):

Ground Truth ◽

Ground Truth Generation

Download Full-text

Density based semi-automatic labeling on multi-feature representations for ground truth generation: Application to handwritten character recognition

Knowledge-Based Systems ◽

10.1016/j.knosys.2021.106953 ◽

2021 ◽

Vol 220 ◽

pp. 106953

Author(s):

Papangkorn Inkeaw ◽

Piyachat Udomwong ◽

Jeerayut Chaijaruwanich

Keyword(s):

Character Recognition ◽

Ground Truth ◽

Handwritten Character Recognition ◽

Feature Representations ◽

Handwritten Character ◽

Ground Truth Generation

Download Full-text

Enhanced Algorithm of Automated Ground Truth Generation and Validation for Lane Detection System by $\text{M}^{2}\text{BMT}$

IEEE Transactions on Intelligent Transportation Systems ◽

10.1109/tits.2016.2594055 ◽

2017 ◽

Vol 18 (4) ◽

pp. 996-1005 ◽

Cited By ~ 3

Author(s):

Apurba Das ◽

N Siva Srinivasa Murthy ◽

Upendra Suddamalla

Keyword(s):

Detection System ◽

Ground Truth ◽

Lane Detection ◽

Ground Truth Generation

Download Full-text

Lake-effect rains over Lake Victoria and their association with Mesoscale Convective Systems

Journal of Hydrometeorology ◽

10.1175/jhm-d-20-0244.1 ◽

2021 ◽

Author(s):

Sharon E. Nicholson ◽

Douglas Klotter ◽

Adam T. Hartman

Keyword(s):

Large Scale ◽

Lake Victoria ◽

Ground Truth ◽

Mesoscale Convective Systems ◽

Convective Systems ◽

Lake Effect ◽

Mesoscale Convective ◽

Strong Convection ◽

Short Rains ◽

Lake Catchment

AbstractThis article examined rainfall enhancement over Lake Victoria. Estimates of over-lake rainfall were compared with rainfall in the surrounding lake catchment. Four satellite products were initially tested against estimates based on gauges or water balance models. These included TRMM 3B43, IMERG V06 Final Run (IMERG-F), CHIRPS2, and PERSIANN-CDR. There was agreement among the satellite products for catchment rainfall but a large disparity among them for over-lake rainfall. IMERG-F was clearly an outlier, exceeding the estimate from TRMM 3B43 by 36%. The overestimation by IMERG-F was likely related to passive microwave assessments of strong convection, such as prevails over Lake Victoria. Overall, TRMM 3B43 showed the best agreement with the "ground truth" and was used in further analyses. Over-lake rainfall was found to be enhanced compared to catchment rainfall in all months. During the March-to-May long rains the enhancement varied between 40% and 50%. During the October-to-December short rains the enhancement varied between 33% and 44%. Even during the two dry seasons the enhancement was at least 20% and over 50% in some months. While the magnitude of enhancement varied from month to month, the seasonal cycle was essentially the same for over-lake and catchment rainfall, suggesting that the dominant influence on over-lake rainfall is the large-scale environment. The association with Mesoscale Convective Systems (MCSs) was also evaluated. The similarity of the spatial patterns of rainfall and MCS count each month suggested that these produced a major share of rainfall over the lake. Similarity in interannual variability further supported this conclusion.

Download Full-text

Open Source Software Implementation of Anatomical Segmentation

Inquiry@Queen's Undergraduate Research Conference Proceedings ◽

10.24908/iqurcp.9960 ◽

2018 ◽

Author(s):

Maggie Hess

Keyword(s):

Large Scale ◽

Focused Ultrasound ◽

Region Growing ◽

Ground Truth ◽

Clot Lysis ◽

Focused Ultrasound Surgery ◽

Manual Methods ◽

Areas Of Interest ◽

Seed Points ◽

Viable Method

Purpose: Intraventricular hemorrhage (IVH) affects nearly 15% of preterm infants. It can lead to ventricular dilation and cognitive impairment. To ablate IVH clots, MR-guided focused ultrasound surgery (MRgFUS) is investigated. This procedure requires accurate, fast and consistent quantification of ventricle and clot volumes. Methods: We developed a semi-autonomous segmentation (SAS) algorithm for measuring changes in the ventricle and clot volumes. Images are normalized, and then ventricle and clot masks are registered to the images. Voxels of the registered masks and voxels obtained by thresholding the normalized images are used as seed points for competitive region growing, which provides the final segmentation. The user selects the areas of interest for correspondence after thresholding and these selections are the final seeds for region growing. SAS was evaluated on an IVH porcine model. Results: SAS was compared to ground truth manual segmentation (MS) for accuracy, efficiency, and consistency. Accuracy was determined by comparing clot and ventricle volumes produced by SAS and MS. In Two-One-Sided Test, SAS and MS were found to be significantly equivalent (p < 0.01). SAS on average was found to be 15 times faster than MS (p < 0.01). Consistency was determined by repeated segmentation of the same image by both SAS and manual methods, SAS being significantly more consistent than MS (p < 0.05). Conclusion: SAS is a viable method to quantify the IVH clot and the lateral brain ventricles and it is serving in a large- scale porcine study of MRgFUS treatment of IVH clot lysis.

Download Full-text