New Results in End-to-end Image and Video Compression by Deep Learning

The available stereo matching algorithms produce large number of false positive matches or only produce a few true-positives across oblique stereo images with large baseline. This undesired result happens due to the complex perspective deformation and radiometric distortion across the images. To address this problem, we propose a novel affine invariant feature matching algorithm with subpixel accuracy based on an end-to-end convolutional neural network (CNN). In our method, we adopt and modify a Hessian affine network, which we refer to as IHesAffNet, to obtain affine invariant Hessian regions using deep learning framework. To improve the correlation between corresponding features, we introduce an empirical weighted loss function (EWLF) based on the negative samples using K nearest neighbors, and then generate deep learning-based descriptors with high discrimination that is realized with our multiple hard network structure (MTHardNets). Following this step, the conjugate features are produced by using the Euclidean distance ratio as the matching metric, and the accuracy of matches are optimized through the deep learning transform based least square matching (DLT-LSM). Finally, experiments on Large baseline oblique stereo images acquired by ground close-range and unmanned aerial vehicle (UAV) verify the effectiveness of the proposed approach, and comprehensive comparisons demonstrate that our matching algorithm outperforms the state-of-art methods in terms of accuracy, distribution and correct ratio. The main contributions of this article are: (i) our proposed MTHardNets can generate high quality descriptors; and (ii) the IHesAffNet can produce substantial affine invariant corresponding features with reliable transform parameters.

Download Full-text

Deep learning for end-to-end kidney cancer diagnosis on multi-phase abdominal computed tomography

npj Precision Oncology ◽

10.1038/s41698-021-00195-y ◽

2021 ◽

Vol 5 (1) ◽

Author(s):

Kwang-Hyun Uhm ◽

Seung-Won Jung ◽

Moon Hyung Choi ◽

Hong-Kyu Shin ◽

Jae-Ik Yoo ◽

...

Keyword(s):

Computed Tomography ◽

Deep Learning ◽

Kidney Cancer ◽

Renal Tumor ◽

Renal Tumors ◽

Abdominal Computed Tomography ◽

Wide Range ◽

End To End ◽

Multi Phase ◽

Histologic Subtypes

AbstractIn 2020, it is estimated that 73,750 kidney cancer cases were diagnosed, and 14,830 people died from cancer in the United States. Preoperative multi-phase abdominal computed tomography (CT) is often used for detecting lesions and classifying histologic subtypes of renal tumor to avoid unnecessary biopsy or surgery. However, there exists inter-observer variability due to subtle differences in the imaging features of tumor subtypes, which makes decisions on treatment challenging. While deep learning has been recently applied to the automated diagnosis of renal tumor, classification of a wide range of subtype classes has not been sufficiently studied yet. In this paper, we propose an end-to-end deep learning model for the differential diagnosis of five major histologic subtypes of renal tumors including both benign and malignant tumors on multi-phase CT. Our model is a unified framework to simultaneously identify lesions and classify subtypes for the diagnosis without manual intervention. We trained and tested the model using CT data from 308 patients who underwent nephrectomy for renal tumors. The model achieved an area under the curve (AUC) of 0.889, and outperformed radiologists for most subtypes. We further validated the model on an independent dataset of 184 patients from The Cancer Imaging Archive (TCIA). The AUC for this dataset was 0.855, and the model performed comparably to the radiologists. These results indicate that our model can achieve similar or better diagnostic performance than radiologists in differentiating a wide range of renal tumors on multi-phase CT.

Download Full-text

End-to-End Prediction of Parcel Delivery Time with Deep Learning for Smart-City Applications

IEEE Internet of Things Journal ◽

10.1109/jiot.2021.3077007 ◽

2021 ◽

pp. 1-1

Author(s):

Arthur Cruz de Araujo ◽

Ali Etemad

Keyword(s):

Deep Learning ◽

Smart City ◽

Delivery Time ◽

End To End

Download Full-text

End-to-End, Pixel-Wise Vessel-Specific Coronary and Aortic Calcium Detection and Scoring Using Deep Learning

Diagnostics ◽

10.3390/diagnostics11020215 ◽

2021 ◽

Vol 11 (2) ◽

pp. 215

Author(s):

Gurpreet Singh ◽

Subhi Al’Aref ◽

Benjamin Lee ◽

Jing Lee ◽

Swee Tan ◽

...

Keyword(s):

Deep Learning ◽

Coronary Artery ◽

Clinical Importance ◽

Learning Model ◽

Mean Difference ◽

Interquartile Range ◽

History Of ◽

End To End ◽

Artery Disease ◽

Deep Learning Model

Conventional scoring and identification methods for coronary artery calcium (CAC) and aortic calcium (AC) result in information loss from the original image and can be time-consuming. In this study, we sought to demonstrate an end-to-end deep learning model as an alternative to the conventional methods. Scans of 377 patients with no history of coronary artery disease (CAD) were obtained and annotated. A deep learning model was trained, tested and validated in a 60:20:20 split. Within the cohort, mean age was 64.2 ± 9.8 years, and 33% were female. Left anterior descending, right coronary artery, left circumflex, triple vessel, and aortic calcifications were present in 74.87%, 55.82%, 57.41%, 46.03%, and 85.41% of patients respectively. An overall Dice score of 0.952 (interquartile range 0.921, 0.981) was achieved. Stratified by subgroups, there was no difference between male (0.948, interquartile range 0.920, 0.981) and female (0.965, interquartile range 0.933, 0.980) patients (p = 0.350), or, between age <65 (0.950, interquartile range 0.913, 0.981) and age ≥65 (0.957, interquartile range 0.930, 0.9778) (p = 0.742). There was good correlation and agreement for CAC prediction (rho = 0.876, p < 0.001), with a mean difference of 11.2% (p = 0.100). AC correlated well (rho = 0.947, p < 0.001), with a mean difference of 9% (p = 0.070). Automated segmentation took approximately 4 s per patient. Taken together, the deep-end learning model was able to robustly identify vessel-specific CAC and AC with high accuracy, and predict Agatston scores that correlated well with manual annotation, facilitating application into areas of research and clinical importance.

Download Full-text

Learned image and video compression with deep neural networks

2020 IEEE International Conference on Visual Communications and Image Processing (VCIP) ◽

10.1109/vcip49819.2020.9301828 ◽

2020 ◽

Author(s):

Dong Xu ◽

Guo Lu ◽

Ren Yang ◽

Radu Timofte

Keyword(s):

Neural Networks ◽

Video Compression ◽

Deep Neural Networks ◽

Image And Video Compression

Download Full-text

A note on the extension of a family of biorthogonal Coifman wavelet systems

The ANZIAM Journal ◽

10.1017/s1446181100013717 ◽

2004 ◽

Vol 46 (1) ◽

pp. 111-120 ◽

Cited By ~ 5

Author(s):

Zhuhan Jiang ◽

Xiling Guo

Keyword(s):

Video Compression ◽

Vanishing Moments ◽

Coifman Wavelets ◽

Image And Video Compression

AbstractWavelet systems with a maximum number of balanced vanishing moments are known to be extremely useful in a variety of applications such as image and video compression. Tian and Wells recently created a family of such wavelet systems, called the biorthogonal Coifman wavelets, which have proved valuable in both mathematics and applications. The purpose of this work is to establish along with direct proofs a very neat extension of Tian and Wells' family of biorthogonal Coifman wavelets by recovering other “missing” members of the biorthogonal Coifman wavelet systems.

Download Full-text