Motion Segmentation and Matting by Graph Cut

Author(s):  
Jiangjian Xiao

Given a video sequence, obtaining accurate layer segmentation and alpha matting is very important for video representation, analysis, compression, and synthesis. By assuming that a scene can be approximately described by multiple planar or surface regions, this chapter describes a robust approach to automatically detect the region clusters and perform accurate layer segmentation for the scene. The approach starts from optical flow field or small corresponding seed regions and applies a clustering approach to estimate the layer number and support regions. Then, it uses graph cut algorithm combined with a general occlusion constraint over multiple frames to solve pixel assignment over multiple frames to obtain more accurate segmentation boundary and identify the occluded pixels. For the non-textured ambiguous regions, an alpha matting technique is further used to refine the segmentation and resolve the ambiguities by determining proper alpha values for the foreground and background, respectively. Based on the alpha mattes, the foreground object can be transferred into the other video sequence to generate a virtual video. The author’s experiments show that the proposed approach is effective and robust for both the challenging real and synthetic sequences.

2019 ◽  
Vol 63 (5) ◽  
pp. 50401-1-50401-7 ◽  
Author(s):  
Jing Chen ◽  
Jie Liao ◽  
Huanqiang Zeng ◽  
Canhui Cai ◽  
Kai-Kuang Ma

Abstract For a robust three-dimensional video transmission through error prone channels, an efficient multiple description coding for multi-view video based on the correlation of spatial polyphase transformed subsequences (CSPT_MDC_MVC) is proposed in this article. The input multi-view video sequence is first separated into four subsequences by spatial polyphase transform and then grouped into two descriptions. With the correlation of macroblocks in corresponding subsequence positions, these subsequences should not be coded in completely the same way. In each description, one subsequence is directly coded by the Joint Multi-view Video Coding (JMVC) encoder and the other subsequence is classified into four sets. According to the classification, the indirectly coding subsequence selectively employed the prediction mode and the prediction vector of the counter directly coding subsequence, which reduces the bitrate consumption and the coding complexity of multiple description coding for multi-view video. On the decoder side, the gradient-based directional interpolation is employed to improve the side reconstructed quality. The effectiveness and robustness of the proposed algorithm is verified by experiments in the JMVC coding platform.


2019 ◽  
Vol 29 (08) ◽  
pp. 2050133
Author(s):  
Anas Fouad Ahmed ◽  
Mohammed Abdulmunem Ahmed ◽  
Hussain Mustafa Bierk

This paper introduces an efficient and robust method for heartbeat detection based on the calculated angles between the successive samples of electrocardiogram (ECG) signal. The proposed approach involves three stages: filtering, computing the angles of the signal and thresholding. The suggested method is applied to two different types of ECG databases (QTDB and MIT-BIH). The results were compared with the other algorithms suggested in previous works. The proposed approach outperformed the other algorithms, in spite of its simplicity and their fast calculations. These features make it applicable in real-time ECG diagnostics systems. The suggested method was implemented in real-time using a low cost ECG acquisition system and it shows excellent performance.


2002 ◽  
Vol 8 (2-3) ◽  
pp. 145-165 ◽  
Author(s):  
FABIO CIRAVEGNA ◽  
ALBERTO LAVELLI

This paper proposes a robust approach to parsing suitable for Information Extraction (IE) from texts using finite-state cascades. The approach is characterized by the construction of an approximation of the full parse tree that captures all the information relevant for IE purposes, leaving the other relations underspecified. Sequences of cascades of finite-state rules deterministically analyze the text, building unambiguous structures. Initially basic chunks are analyzed; then clauses are recognized and nested; finally modifier attachment is performed and the global parse tree is built. The parsing approach allows robust, effective and efficient analysis of real world texts. The grammar organization simplifies changes, insertion of new rules and integration of domain-oriented rules. The approach has been tested for Italian, English, and Russian. A parser based on such an approach has been implemented as part of Pinocchio, an environment for developing and running IE applications.


2019 ◽  
Vol 14 (2) ◽  
pp. 139-147 ◽  
Author(s):  
Zhang Yang ◽  
Xie Yingying ◽  
Guo Li ◽  
Zhang Zewei ◽  
Ding Weifeng ◽  
...  

Background: Lung cancer is a greatest threat to people's health and life. CT image leads to unclear boundary segmentation. Segmentation of irregular nodules and complex structure, boundary information is not well considered and lung nodules have always been a hot topic. Objective: In this study, the pulmonary nodule segmentation is accomplished with the new graph cut algorithm. The problem of segmenting the juxta-pleural and juxta-vascular nodules was investigated which is based on graph cut algorithm. Methods: Firstly, the inflection points by the curvature was decided. Secondly, we used kernel graph cut to segment the nodules for the initial edge. Thirdly, the seeds points based on cast raying method is performed; lastly, a novel geodesic distance function is proposed to improve the graph cut algorithm and applied in lung nodules segmentation. Results: The new algorithm has been tested on total 258 nodules. Table 1 summarizes the morphologic features of all the nodules and given the results between the successful segmentation group and the poor/failed segmentation group. Figure 1 to Fig. (12) shows segmentation effect of Juxta-vascular nodules, Juxta-pleural nodules, and comparted with the other interactive segmentation methods. Conclusion: The experimental verification shows better results with our algorithm, the results will measure the volume numerical approach to nodule volume. The results of lung nodules segmentation in this study are as good as the results obtained by the other methods.


Perception ◽  
1997 ◽  
Vol 26 (11) ◽  
pp. 1367-1380 ◽  
Author(s):  
Carol M Cicerone ◽  
Donald D Hoffman

‘Color from motion’ describes the perception of a spread of subjective color over achromatic regions seen as moving. The effect can be produced in a display of multiple frames shown in quick succession, each frame consisting of a fixed, random placement of colored dots on a high-luminance white background with color assignments of some dots, but not dot locations, changing from frame to frame. Evidence is presented that the perception of apparent motion and the spread of subjective color can be activated by binocular combination of disjoint signals to each eye. The dichoptic presentation of every odd-numbered frame of the full stimulus sequence presented to one eye and, out of phase, every even-numbered frame to the other eye produces a compelling perception of color from motion equal to that seen with the full sequence presented to each eye alone. This is consistent with the idea that color from motion is regulated in sites at or beyond the convergence of monocular pathways. When the background field in the stimulus display is of low luminance, an amodally complete object, fully colored and matching the dots defining the moving region in hue and saturation, is seen to move behind a partially occluding screen. Observers do not perceive such an object in still view. Hence, color from motion can be used by the visual system to produce amodal completion, which suggests that it may play a role in enhancing the visibility of camouflaged objects.


1998 ◽  
Vol 44 (3) ◽  
pp. 622-631 ◽  
Author(s):  
Paul S Horn ◽  
Amadeo J Pesce ◽  
Bradley E Copeland

Abstract We propose a new methodology for the estimation of reference intervals for data sets with small numbers of observations or for those with substantial numbers of outliers. We propose a prediction interval that uses robust estimates of location and scale. The SAS software can be readily modified to do these calculations. We compared four reference interval procedures (nonparametric, transformed, robust with a nonparametric lower limit, and transformed robust) for sample sizes of 20, 40, 60, 80, 100, and 120 from χ2 distributions of 1, 4, 7, and 10 df. χ2 distributions were chosen because they simulate the skewness of distributions often found in clinical chemistry populations. We used the root mean square error as the measure of performance and used computer simulation to calculate this measure. The robust estimator showed the best performance for small sample sizes. As the sample size increased, the performance values converged. The robust method for calculating upper reference interval values yields reasonable results. In two examples using real data for haptoglobin and glucose, the robust estimator provides slightly smaller upper reference limits than the other procedures. Lastly, the robust estimator was compared with the other procedures in a population where 5% of the values were multiplied by a factor of 5. The reference intervals were calculated with and without outlier detection. In this case, the robust approach consistently yielded upper reference interval values that were closer to those of the true underlying distributions. We propose that robust statistical analysis can be of great use for determinations of reference intervals from limited or possibly unreliable data.


2012 ◽  
Vol 424-425 ◽  
pp. 151-154
Author(s):  
Shi Lin Zhang ◽  
He Ping Li ◽  
Shu Wu Zhang

Video object extraction is a critical task in multimedia analysis and editing. Normally, the user provides some hints of foreground and background, and then the target object is extracted from the video sequence. In this paper, we propose a object segmentation system that integrates a clustering model with Markov random field-based contour tracking and graph-cut image segmentation. The contour tracking propagates the shape of the target object, whereas the graph-cut refines the shape and improves the accuracy of video segmentation. Experimental results show that our segmentation system is efficient


2009 ◽  
Vol 8 (3) ◽  
pp. 45-50 ◽  
Author(s):  
Ya-lin Wu ◽  
Soon-kak Kwon

We propose a transcoding method of H.264 coded bitsteam to control the picture quality dependently on the interest region. In the proposed method, first we find the model of quantization step-size and bitrate. Then a classification method according to the subjectively interest region within a video sequence is suggested. Also we propose a method that assigns a specific quantization step-size differentially according to the interest region within a video. In general, the subjective picture quality can be increased by applying the quantization step-size as a small value relatively for the interest region compared with the other regions. From the simulation, we show that the proposed method can make better subjective picture quality relatively in parts of interest region.


Sign in / Sign up

Export Citation Format

Share Document