next up previous contents
Next: Summary and Conclusions Up: No Title Previous: Contents

Introduction

 

%As one of the most successful applications of image analysis and understanding, face recognition has recently received significant attention, especially during the past few years. This is evidenced by the emergence of face recognition conferences such as AVBPA [1] and AFGR [2], systematic empirical evaluations of face recognition techniques (FRT), including the FERET  [126, 129, 125] FRVT 2000 [25], FRVT 2002 [122], and XM2VTS [108] protocols, and many commercially available systems (Table gif). There are at least two reasons for this trend; the first is the wide range of commercial and law enforcement applications and the second is the availability of feasible technologies after 30 years of research. In addition, the problem of machine recognition of human faces continues to attract researchers from disciplines such as image processing, pattern recognition, neural networks, computer vision, computer graphics, and psychology.

The strong need for user-friendly systems that can secure our assets and protect our privacy without losing our identity in a sea of numbers is obvious. At present, one needs a PIN to get cash from an ATM, a password for a computer, a dozen others to access the internet, and so on. Although very reliable methods of biometric personal identification exist, e.g., fingerprint analysis and retinal or iris scans, these methods rely on the cooperation of the participants, whereas a personal identification system based on analysis of frontal or profile images of the face is often effective without the participant's cooperation or knowledge. Some of the advantages/disadvantages of different biometrics are described in [124]. Table gif lists some of the applications of face recognition.

   table26
Table: Typical applications of face recognition.

Commercial and law enforcement applications of FRT range from static, controlled-format photographs to uncontrolled video images, posing a wide range of technical challenges and requiring an equally wide range of techniques from image processing, analysis, understanding and pattern recognition. One can broadly classify FRT systems into two groups depending on whether they make use of static images or of video. Within these groups, significant differences exist, depending on the specific application. The differences are in terms of image quality, amount of background clutter (posing challenges to segmentation algorithms), variability of the images of a particular individual that must be recognized, availability of a well-defined recognition or matching criterion, and the nature, type and amount of input from a user. A list of some commercial systems is given in Table gif.

   table49
Table: Available commercial face recognition systems. (Some of these websites may have changed or been removed.) [The identification of any company, commercial product, or trade name does not imply endorsement or recommendation by the National Institute of Standards and Technology or any of the authors or their institutions.]

A general statement of the problem of machine recognition of faces can be formulated as follows: Given still or video images of a scene, identify or verify one or more persons in the scene using a stored database of faces. Available collateral information such as race, age, gender, facial expression, or speech may be used in narrowing the search (enhancing recognition). The solution to the problem involves segmentation of faces (face detection) from cluttered scenes, feature extraction from the face regions, recognition, or verification (Fig. gif). In identification problems, the input to the system is an unknown face, and the system reports back the determined identity from a database of known individuals, whereas in verification problems, the system needs to confirm or reject the claimed identity of the input face.

   figure61
Figure: Configuration of a generic face recognition system.

Face perception is an important part of the capability of human perception system and is a routine task for humans, while building a similar computer system is still an on-going research area. The earliest work on face recognition can be traced back at least to the 1950's in psychology [35] and to the 1960's in the engineering literature [28]. (Some of the earliest studies include work on facial expression of emotions by Darwin [44] (see also Ekman [48]) and on facial profile based biometrics by Galton [54]). But research on automatic machine recognition of faces really started in the 1970's after the seminal work of Kanade [80] and Kelly [81]. Over the past thirty years extensive research has been conducted by psychophysicists, neuroscientists, and engineers on various aspects of face recognition by humans and machines. Psychophysicists and neuroscientists have been concerned with issues such as whether face perception is a dedicated process (this issue is still being debated in the psychology community [49, 21, 56, 55]) and whether it is done holistically or by local feature analysis.

Many of the hypotheses and theories put forward by researchers in these disciplines have been based on rather small sets of images. Nevertheless, many of the findings have important consequences for engineers who design algorithms and systems for machine recognition of human faces. Section 2 will present a concise review of these findings.

Barring a few exceptions that use range data [62], the face recognition problem has been formulated as recognizing 3D objects from 2D images gif. Earlier approaches treated it as a 2D pattern recognition problem. As a result, during the early and mid-1970's, typical pattern classification techniques, which use measured attributes of features (e.g. the distances between important points) in faces or face profiles, were used [28, 81, 80]. During the 1980's, work on face recognition remained largely dormant. Since the early 1990's, research interest in FRT has grown significantly. One can attribute this to several reasons: An increase in interest in commercial opportunities; the availability of real-time hardware; and the increasing importance of surveillance-related applications.

Over the past 15 years, research has focused on how to make face recognition systems fully automatic by tackling problems such as localization of a face in a given image or video clip and extraction of features such as eyes, mouth, etc. Meanwhile, significant advances have been made in the design of classifiers for successful face recognition. Among appearance-based holistic approaches, eigenfaces [82, 153] and Fisherfaces [12, 50, 168] have proved to be effective in experiments with large databases. Feature-based graph matching approaches [160] have also been quite successful. Compared to holistic approaches, feature-based methods are less sensitive to variations in illumination and viewpoint and to inaccuracy in face localization. However, the feature extraction techniques needed for this type of approach are still not reliable or accurate enough [42]. For example, most eye localization techniques assume some geometric and textural models and do not work if the eye is closed. Section 3 will present a review of still-image based face recognition.

During the past five to eight years, much research has been concentrated on video-based face recognition. The still image problem has several inherent advantages and disadvantages. For applications such as drivers' licenses, due to the controlled nature of the image acquisition process, the segmentation problem is rather easy. However, if only a static picture of an airport scene is available, automatic location and segmentation of a face could pose serious challenges to any segmentation algorithm. On the other hand, if a video sequence is available, segmentation of a moving person can be more easily accomplished using motion as a cue. But the small size and low image quality of faces captured from video can significantly increase the difficulty in recognition. Video-based face recognition is reviewed in Section 4.

As we propose new algorithms and build more systems, measuring the performance of new systems and of existing systems becomes very important. Systematic data collection and evaulation of face recognition systems is reviewed in Section 5.

Recognizing a 3D object from its 2D images poses many challenges. The illumination and pose problems are two prominent issues for appearance- or image-based approaches. Many approaches have been proposed to handle these issues, with the majority of them exploring domain knowledge. Details of these approaches are discussed in Section 6.

In 1995, a review paper [37] gave a thorough survey of FRT at that time. (An earlier survey [134] appeared in 1992.) At that time, video-based face recognition was still in a nascent stage. During the past eight years, face recognition has received increased attention and has advanced technically. Many commercial systems for still face recognition are now available. Recently, significant research efforts have been focused on video-based face modeling/tracking, recognition, and system integration. New datasets have been created and evaluations of recognition techniques using these databases have been carried out. It is not an overstatement to say that face recognition has become one of the most active applications of pattern recognition, image analysis and understanding.

In this paper we provide a critical review of current developments in face recognition. This paper is organized as follows: In Section 2 we briefly review issues that are relevant from a psychophysical point of view. Section 3 provides a detailed review of recent developments in face recognition techniques using still images. In Section 4 face recognition techniques based on video are reviewed. Data collection and performance evaluation of face recognition algorithms are addressed in Section 5 with descriptions of representative protocols. In Section 6 we discuss two important problems in face recognition that can be mathematically studied, lack of robustness to illumination and pose variations, and we review proposed methods of overcoming these limitations. Finally, a summary and conclusions are presented in Section 7.


next up previous contents
Next: Summary and Conclusions Up: No Title Previous: Contents

Wenyi Zhao
Mon Mar 8 16:07:04 EST 2004