PA 218/17
Computer scientists at the University of Nottingham and Kingston University have solved a complex problem that has, until now, defeated experts in vision and graphics research. They have developed technology capable of producing 3D facial reconstruction from a single 2D image - the 3D selfie.
Their new web app allows people to upload a single colour image and receive, in a few seconds, a 3D model showing the shape of their face. People are queuing up to try it and so far, more than 400,000 users have had a go. You can do it yourself by taking a selfie and uploading it to their website.
The research – 'Large Pose 3D Face Reconstruction from a Single Image via Direct Volumetric CNN Regression’ – was led by PhD student Aaron Jackson and carried out with fellow PhD student Adrian Bulat both based in the Computer Vision Laboratory in the School of Computer Science. Both students are supervised by Georgios (Yorgos) Tzimiropoulos, Assistant Professor in the School of Computer Science. The work was done in collaboration with Dr Vasileios Argyriou from the School of Computer Science and Mathematics at Kingston University.
The results will be presented at the International Conference on Computer Vision (ICCV) 2017 in Venice next month.
Technology at a very early stage
The technique is far from perfect but this is the breakthrough computer scientists have been looking for.
It has been developed using a Convolutional Neural Network (CNN) – an area of artificial intelligence (AI) which uses machine learning to give computers the ability to learn without being explicitly programmed.
The research team, supervised by Dr Yorgos Tzimiropoulos, trained a CNN on a huge dataset of 2D pictures and 3D facial models. With all this information their CNN is able to reconstruct 3D facial geometry from a single 2D image. It can also take a good guess at the non-visible parts of the face.
Simple idea complex problem
Dr Tzimiropoulos said: “The main novelty is in the simplicity of our approach which bypasses the complex pipelines typically used by other tecniques. We instead came up with the idea of training a big neural network on 80,000 faces to directly learn to output the 3D facial geometry from a single 2D image.”
This is a problem of extraordinary difficulty. Current systems require multiple facial images and face several challenges, such as dense correspondences across large facial poses, expressions and non-uniform illumination.
Aaron Jackson said: “Our CNN uses just a single 2D facial image, and works for arbitrary facial poses (for instance front or profile images) and facial expressions (for instance smiling).”
Adrian Bulat said “The method can be used to reconstruct the whole 3D facial geometry including the non-visible parts of the face.”
Their technique demonstrates some of the advances possible through deep learning – a form of machine learning that uses artificial neural networks to mimic the way the brain makes connections between pieces of information.
Dr Vasileios Argyriou, from Kingston University’s Faculty of Science, Engineering and Computing, said: “What’s really impressive about this technique is how it has made the process of creating a 3D facial model so simple.”
What could the applications be?
Aside from the more standard applications, such as face and emotion recognition, this technology could be used to personalise computer games, improve augmented reality, and let people try on online accessories such as glasses.
It could also have medical applications – such as simulating the results of plastic surgery or helping to understand medical conditions such as autism and depression.
Their video example can be seen here.
Code is available here.
Their online demo can be found here.
Aaron’s PhD is funded by the University of Nottingham. His research is focused on deep learning applied to the human face. This includes 3D reconstruction and segmentation applied to the human face and body.
Adrian Bulat is a PhD student in the Computer Vision Lab. His main research interests are in the area of face analysis, human pose estimation and neural network quantization/binarization.
— Ends —
Our academics can now be interviewed for broadcast via our Media Hub, which offers a Globelynx fixed camera and ISDN line facilities at University Park campus. For further information please contact a member of the Communications team on +44 (0)115 951 5798, email mediahub@nottingham.ac.uk or see the Globelynx website for how to register for this service.
For up to the minute media alerts, follow us on Twitter
Notes to editors:
The University of Nottingham is a research-intensive university with a proud heritage, consistently ranked among the world's top 100. Studying at the University of Nottingham is a life-changing experience and we pride ourselves on unlocking the potential of our 44,000 students - Nottingham was named University of the Year for Graduate Employment in the 2017 Times and Sunday Times Good University Guide, was awarded gold in the TEF 2017 and features in the top 20 of all three major UK rankings. We have a pioneering spirit, expressed in the vision of our founder Sir Jesse Boot, which has seen us lead the way in establishing campuses in China and Malaysia - part of a globally connected network of education, research and industrial engagement. We are ranked eighth for research power in the UK according to REF 2014. We have six beacons of research excellence helping to transform lives and change the world; we are also a major employer and industry partner - locally and globally.
Impact: The Nottingham Campaign, its biggest-ever fundraising campaign, is delivering the University’s vision to change lives, tackle global issues and shape the future. More news…