In this blog post, we will look at the principles and potential developments of eye-tracking technology that allows you to operate your smartphone without using your hands.
On a cold day, when you need to use a navigation app or search for a restaurant online, there are times when you don’t want to take your hands out of your pockets. In such moments, the cold isn’t the only thing that bothers you—even the smallest inconveniences feel magnified. On extremely tired days, you might not even want to move a finger. On days like these, even simple actions can feel like a huge burden. Then, the notification sound from a friend’s text message rings on your phone. Even unlocking the screen feels like a hassle, let alone replying to the message. But now, there’s a technology that can help us in such situations. Using eye-tracking technology, you can easily solve the problems mentioned earlier. This technology is no longer a futuristic concept seen only in science fiction movies; it has already begun to deeply integrate into our daily lives.
By attaching a simple device and running the software for this technology on any compatible device—such as a smartphone, tablet PC, or laptop—we can perform desired actions simply by moving our eyes, without touching the device. What are the principles behind eye-tracking technology, how advanced is it, and how much impact could it have on our lives?
Technology that recognizes and tracks eye movements has been researched in various ways. This technology is being actively developed in the field of human-computer interaction (HCI) and its applications are continuing to expand. For example, in the field of education, this technology can be used to analyze students’ eye movements to identify learning patterns and provide customized educational content.
First, there is a method that tracks eye movements by recognizing the contours of the eyes and the pupil area. This method involves capturing the entire input face image and then performing an eye region detection algorithm based on contrasts in unique features such as skin color, eye color, and shape to track eye movements. However, this method is limited in accuracy as it roughly infers the direction of gaze based on computer algorithms rather than accurately predicting the exact direction. While this approach is cost-effective and useful for initial research or simple applications, its limitations become evident in tasks requiring precision.
The second method uses an infrared camera and an infrared light source. When two infrared light sources are shone onto the pupil, the light reflected from the cornea is recognized by the camera, and the final gaze is estimated using a simple gaze tracking algorithm. The user first looks at two points to set the basic elements of the gaze tracking algorithm, and from then on, the camera automatically tracks the gaze according to the algorithm. This system is flexible because there is no need to fix the camera or the user’s head in place. This technology plays an important role in providing realistic virtual reality (VR) experiences. For example, when the user moves their gaze, the viewpoint in the virtual environment also moves naturally, providing a more immersive experience.
Finally, there is a method that uses the AdaBoost algorithm. This algorithm is often used in smartphones that use eye tracking technology. This algorithm connects the results of other basic learning algorithms by assigning weights to them, making it specialized for tasks that require multiple repetitive inputs, such as eye tracking. After recognizing the user’s gaze through a camera, the AdaBoost algorithm calculates the precise movement of the gaze and integrates it with smartphone operating systems like Android to perform tasks.
The development of such algorithms enables eye-tracking technology to go beyond simply processing visual data and provide more sophisticated interfaces by more accurately understanding the user’s intentions. Eye recognition technology is not just in the development stage, but has already been commercialized and is available for purchase. A representative focus tracking product is Eye Tribe.
According to Sune Johansen, founder of The Eye Tribe, Eye Tribe is a technology name and product name that refers to technology that reads the movement of the human eye and performs various actions on the screen through software that can be installed on electronic devices and a miniature focus recognition camera that can be attached to devices. Eye Tribe is developed using C and Java, two highly compatible programming languages, making it compatible with most common operating systems such as Windows 7 and Android. The Eye Tribe website demonstrates how the technology can be used to operate a map application on a smartphone or run simple mobile games, all through eye movements alone.
The potential for development of eye tracking technology is very bright, as it does not rely solely on the precision of measurement devices. Instead, as better algorithms are developed, more precise eye tracking and the execution of complex actions become possible. Such developments hold particularly significant potential in industrial settings. For example, in the medical field, surgeons will be able to operate medical devices with just their eyes, contributing to increased accuracy and efficiency in surgery. In other words, if someone develops a more innovative algorithm, it may be possible to perform complex actions such as typing letters or drawing pictures, rather than just simple functions like switching screens or dragging. However, research on eye fatigue and the possibility of eye diseases is insufficient compared to the speed at which eye-tracking technology is developing. Therefore, it is necessary to consider the impact of eye-tracking technology on users, rather than relying solely on improvements in algorithms and devices.
As future societies pursue greater convenience, the importance of focus recognition technology is increasing. The ability to perform desired actions without touching anything will be appealing to modern people. For example, accidents often occur while driving due to operating navigation or vehicle devices. Eye-tracking technology could be used to improve driver convenience in such situations. If simple functions such as playing music or zooming in on a map can be performed by adjusting focus and gaze, drivers would likely view eye-tracking technology very positively. However, it is important to prevent eye tracking technology from distracting drivers while driving. To prevent this, it is important to design eye tracking technology that analyzes the driver’s gaze patterns to prevent distraction.
From what we have seen so far, eye tracking technology has been researched in various directions, such as infrared reflection and focus recognition technology, and new eye tracking methods continue to emerge. With the ongoing development of algorithms, the technology is advancing day by day, and the scope of application of eye tracking technology is also expanding. Eye tracking technology can be of great help not only to the general public but also to people with physical disabilities. For example, individuals who use wheelchairs or have casts on their hands are restricted in performing actions that require the use of their hands, and eye tracking technology can contribute to resolving such issues. By integrating with wireless communication technology, new interfaces could be developed, such as operating elevator buttons with just eye movements or ordering desired items by focusing on a nearby device without physically touching them. The development of such everyday, simple interfaces has the potential to grow into a field that provides new forms of services for individuals with physical disabilities.