Proceedings Article10.1109/COMPCOMM.2017.8322867
Top-down sampling convolution network for face segmentation
Yisu Zhou
- 01 Dec 2017
7
TL;DR: The paper adopts two different convolution sampling paths: from large scale to small scale sampling (top-down) and small scale to large Scale sampling (bottom-up), and proposes the top-down sampling convolution neural network for face segmentation (TDNN).
read more
Abstract: The paper adopts two different convolution sampling paths: from large scale to small scale sampling (top-down) and small scale to large scale sampling (bottom-up), and propose the top-down sampling convolution neural network for face segmentation (TDNN). On the LFW and the Helen dataset, it is demonstrated about the advantage of face segmentation by TDNN. In addition, the shared weight is added to each convolution integral, we propose TDNN with shared weight (TDNNSW). On the Helen dataset, TDNNSW with shared weight further improves the accuracy of face segmentation. Since TDNN is trained end-to-end, our model has advantageous properties such as less parameters and more rapid calculation for face segmentation.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
3D Head Pose Estimation through Facial Features and Deep Convolutional Neural Networks
Khalil Khan,Jehad Ali,Kashif Ahmad,Asma Gul,Ghulam Sarwar,Sahib Khan,Qui Thanh Hoai Ta,Tae-Sun Chung,Muhammad Attique +8 more
Abstract: Face image analysis is one among several important cues in computer vision. Over the last five decades, methods for face analysis have received immense attention due to large scale applications in various face analysis tasks. Face parsing strongly benefits various human face image analysis tasks inducing face pose estimation. In this paper we propose a 3D head pose estimation framework developed through a prior end to end deep face parsing model. We have developed an end to end face parts segmentation framework through deep convolutional neural networks (DCNNs). For training a deep face parts parsing model, we label face images for seven different classes, including eyes, brows, nose, hair, mouth, skin, and back. We extract features from gray scale images by using DCNNs. We train a classifier using the extracted features. We use the probabilistic classification method to produce gray scale images in the form of probability maps for each dense semantic class. We use a next stage of DCNNs and extract features from grayscale images created as probability maps during the segmentation phase. We assess the performance of our newly proposed model on four standard head pose datasets, including Pointing’04, Annotated Facial Landmarks in the Wild (AFLW), Boston University (BU), and ICT-3DHP, obtaining superior results as compared to previous results.
13
•Posted Content
A Facial Feature Discovery Framework for Race Classification Using Deep Learning.
TL;DR: In this article, a deep convolutional neural network (DCNN) was used to construct a face segmentation model for race classification, and five salient facial features were extracted from the PMs of five classes and a new model was trained based on the DCNN.
4
CADA-GAN: Context-Aware GAN with Data Augmentation
TL;DR: This paper proposed CADA-GAN, a C ontext-a-ware GAN that allows optimal feature extraction, with added robustness from additional D ata-A ugmentation, adapted from the popular StyleGAN2-Ada model, with attention on augmentation and segmentation of the parent images.
Patent
Image processing method, generative adversarial network, electronic equipment and storage medium
Zhang Yiwei,Zhao Yuan,Shen Haifeng +2 more
- 11 Feb 2020
TL;DR: In this paper, a generative adversarial network (GAN) is used to reduce the processing complexity in the image enhancement process, which can improve the restoration capability of tiny textures.
References
Fully convolutional networks for semantic segmentation
Jonathan Long,Evan Shelhamer,Trevor Darrell +2 more
- 07 Jun 2015
TL;DR: The key insight is to build “fully convolutional” networks that take input of arbitrary size and produce correspondingly-sized output with efficient inference and learning.
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
TL;DR: This work addresses the task of semantic image segmentation with Deep Learning and proposes atrous spatial pyramid pooling (ASPP), which is proposed to robustly segment objects at multiple scales, and improves the localization of object boundaries by combining methods from DCNNs and probabilistic graphical models.
18.8K
Pyramid Scene Parsing Network
Hengshuang Zhao,Jianping Shi,Xiaojuan Qi,Xiaogang Wang,Jiaya Jia +4 more
- 21 Jul 2017
TL;DR: This paper exploits the capability of global context information by different-region-based context aggregation through the pyramid pooling module together with the proposed pyramid scene parsing network (PSPNet) to produce good quality results on the scene parsing task.
•Posted Content
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
TL;DR: DeepLab as discussed by the authors proposes atrous spatial pyramid pooling (ASPP) to segment objects at multiple scales by probing an incoming convolutional feature layer with filters at multiple sampling rates and effective fields-of-views.
14.3K
Stacked Hourglass Networks for Human Pose Estimation
Alejandro Newell,Kaiyu Yang,Jia Deng +2 more
- 08 Oct 2016
TL;DR: This work introduces a novel convolutional network architecture for the task of human pose estimation that is described as a “stacked hourglass” network based on the successive steps of pooling and upsampling that are done to produce a final set of predictions.